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Description 

COMPOSITIONS CONTAINING NUCLEIC 
ACIDS AND LIGANDS FOR THERAPEUTIC TREATMENT 

5 

Technical Field 

The present invention relates generally to the treatment of diseases, and 
more specifically, to the preparation and use of complexes containing receptor-binding 
internalized ligands NABD and cytocide-encoding agents to alter the function, gene 
1 0 expression, or viability of a cell in a therapeutic manner. 

Background of the Invention 

A major goal of treatment of neoplastic diseases and hyperproliferative 
disorders is to ablate the abnormally growing cells while leaving normal cells 
1 5 untouched. Various methods are under development for providing treatment, but none 
provide the requisite degree of specificity. 

One method of-treatment is to provide toxins. Immunotoxins and 
cytotoxins are protein conjugates of toxin molecules v^th either antibodies or factors 
which bind to receptors on target cells. Three major problems may limit the usefulness 

20 of immunotoxins. First, the antibodies may react with more than one cell surface 
molecule, thereby effecting delivery to muhiple cell types, possibly including normal 
cells. Second, even if the antibody is specific, the antibody reactive molecule may be 
present on normal cells. Third, the toxin molecule may be toxic to cells prior to 
delivery and internalization. Cytotoxins%uffer from similar disadvantages of specificity 

25 and toxicity. Another limitation in the therapeutic use of inmiunotoxins and cytotoxins 
is the relatively low ratio of therapeutic to toxic dosage. Additionally, it may be 
difficult to direct sufficient concentrations of the toxin into the cytoplasm and 
intracellular compartments in which the agent can exert its desired activity. 

Given these limitations, cj^otoxic therapy has been attempted using viral 

30 vectors to deliver DNA encoding the toxins into cells. If eukaryotic viruses are used, 
such as the retroviruses currently in use, they may recombine with host DNA to produce 
infectious virus. Moreover, because retroviral vectors are often inactivated by the 
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complement system, use in vivo is limited. Retroviral vectors also lack specificity in 
delivery; receptors for most viral vectors are present on a large fraction, if not all, cells. 
Thus, infection v^th such a viral vector will infect normal as well as abnormal cells. 
Because of this general infection mechanism, it is not desirable for the viral vector to 
5 directly encode a cytotoxic molecule. 

While delivery of nucleic acids offers advantages over delivery of 
cytotoxic proteins such as reduced toxicity prior to internalization, there is a need for 
high specificity of delivery, which is currently unavailable with the present systems. 

In view of the problems associated with gene therapy, there is a 
10 compelling need for improved treatments which are more effective and are not 
associated with such disadvantages. The present invention exploits the use of 
conjugates which have increased specificity and deliver higher amounts of nucleic acids 
to targeted cells, while providing other related advantages. 

15 Summary of the Invention 

The present invention generally provides therapeutic compositions. In 
one aspect, the composition has the formula: receptor-binding internalized 
ligand — nucleic acid binding domain — cytocide-encoding agent. The receptor-binding 
internalized ligand is a polypeptide reactive v^th a cell surface receptor, the nucleic acid 

20 binding domain binds to a nucleic acid, the cytocide-encoding agent is a nucleic acid 
molecule encoding a cytocide and which binds to the nucleic acid binding domain, and 
the composition binds to the cell surface receptor and internalizes the cytocide-encoding 
agent in cells bearing the receptor. In another aspect, the composition has the formula: 
receptor-binding internalized ligand-nucleic acid binding comain-prodrug-encoding 

25 agent. 

In certain embodiments, the receptor-binding intemalized ligand is a 
polypeptide reactive with an FGF receptor, VEGF receptor, HBEGF receptor, or a 
cytokine. In other embodiments, the cytocide-encoding agent encodes a protein that 
inhibits protein synthesis and is preferably a ribosome inactivating protein, most 
30 preferably saporin. The protein is gelonin or diphtheria toxin in other embodiments. In 
other embodiments, the prodrug-encoding agent encodes HSV-thymidine kinase. 



TOCID: <WO_9e3e3e2A1JU> 



WO 96/36362 PCTAJS96/07164 



The nucleic acid binding domain is poly-L-lysine in one embodiment. In 
other embodiments, the nucleic acid binding domain is a transcription factor selected 
from the group consisting of helix-tum-helix motif proteins, homeodomain proteins, 
zinc finger motif proteins, steroid receptor proteins, leucine zipper motif proteins, helix- 
5 loop-helix motif proteins, and p-sheet motif proteins. In other embodiments, the 
nucleic acid binding domain binds nonspecifically to nucleic acids and is selected from 
the group consisting of poly-L-lysine, protamine, histone and spermine. In a preferred 
embodiment, the nucleic acid binding domain binds the coding region of a ribosome 
inactivating protein such as saporin. In another preferred embodiment, FGF is 
10 conjugated to poly-L-lysine. 

In yet other embodiments, the cytocide-encoding agent contains a tissue- 
specific promoter, such as alpha-crystalline, gamma-crystalline, a-fetoprotein, CEA, 
prostate-specific antigen, erbB-l, tyrosinase, a-actin, c-myc, VEGF receptor, FGF 
receptor or cyclin D. 

15 In another aspect, the composition also contains a linker. In various 

embodiments, the linker increases the flexibility of the conjugate and is (Gly^Serp)„, 
(Ala Ala Pro Ala)„, wherein n is 1 to 6, m is 1 to 6 and p is 1 to 4, or the linker is a 
disulfide bond. 

In yet another aspect, the composition has the formula: receptor-binding 
20 internalized ligand-cytocide encoding agent-nucleic acid binding domain, wherein the 
receptor-binding internalized ligand is conjugated to the cytocide-encoding agent, 
which is bound to the nucleic acid binding domain to form a complex. 

In other aspects," the invention provides methods for preventing excessive 
cell proliferation in the anterior eye following surgery, treating corneal clouding 
25 following excimer laser surgery, preventing closure of a trabeculectomy, preventing 
pterygii recurrence, treating hyperproliferative diseases in the back of the eye, such as 
macular degeneration, diabetic retinopathy and proliferative virtreal retinopathy, 
treating smooth muscle cell hyperplasia after a wound healing response to a procedure. 
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e.g,, vein grafting, endarterectomies and arteriovenous shunts and treating cancer. In 
these aspects, an effective amount of the compositions described above are 
administered. 

5 Brief Description of the Drawings 

Figure 1 is a photograph of an SDS-PAGE of FGF2-K152 under non- 
reducing (left) and reducing (right) conditions. Lane 1, FGF2-K152; lane 2, FGF2; 
lane 3, FGF2-K152: lane 4, FGF2. The open arrow identifies material unable to enter 
the gel. The closed arrow identifies a protein band corresponding to FGF2. 
10 Figure 2 is a graph depicting the proliferation of bovine aortic 

endothelial cells in response to FGF2 (closed box) and FGF2-K152 (open circle) 
conjugate. 

Figure 3 is a photograph of a gel showing the effects of various lengths 
of poly-L-lysine on the ability to interact with DNA. Thirty-five ng of labeled DNA 
15 were added to increasing concentrations of either FGF2 or FGF2-K: lanes 1, 0 ng; 
lanes 2, 0.1 ng; lanes 3, 1 ng; lanes 4, 10 ng; lanes 5, 20 ng; lanes 6, 35 ng; lanes 7, 
100 ng. Panel A: FGF2; panel B, FGF2-K152; panel C, FGF2-K13; panel D, FGF2- 
K84; panel E, EGF2-K267; panel F, FGF2-K39. The lengths of the digested DNA are 
indicated. 

20 Figure 4 is a chart depicting the activity of p-gal following transfection 

of FGF2/poly-L-lysine/DNAp-gal into COS cells. Lane 1, 10:1 (w/w) ratio of 
FGF2/poly-L-lysine conjugate to DNA; lane 2, 5:1 ratio; lane 3, 2:1 ratio; lane 4, 1:1 
ratio; lane 5, 0.5:1 ratio. The five bars, from left to right, are FGF2, FGF2-K13, FGF2- 
K39, FGF2-K84, and FGF2-K152. 

25 Figure 5 are photographs of toroid format observed by electron 

microscopy. The upper panel shows an example of a toroid; the lower panel shows an 
incomplete toroid. 

Figure 6 is a graph depicting proliferation of bovine aortic-endothelial 
cells. In the upper panel, cells were treated with FGF2-K152-DNA; in the lower panel, 
30 cells were treated with a mixture of FGF2, Kl 52, and DNA. 
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Figure 7A is a graph displaying p-gal activity after transfection of 
FGF2/poly-L-Iysine/pSVp-gal into COS cells (lane 1), B16 cells (lane 2), NIH 3T3 
cells (lane 3), and BHK cells (lane 4). 

Figure 7B is a graph depicting P-gal expression in COS cells, pSVp-gal 
5 (lanes 1, 3) or pNASSp-gal (lanes 2, 4) were incubated with (lanes 1, 2) or without 
(lanes 3, 4) FGF2-K84 and the complexes incubated on COS cells for 48 hrs. 

Figure 7C is a graph showing activity of p-gal activity at various times 
following transfection with either plasmid alone or with complexes of FGF2/K84/pSV 
P-gal. -A., DNA alone; FGF2-K84-DNA. 
10 Figure 7D is a graph showing p-gal activity after transfection of various 

concentrations of FGF2/K84/pSVp-gal. Lane 1, O^ig; lane 2, 0,1 ^g; lane 3, l|ig; lane 4, 
5iig; lane 5, lOjig. 

Figure 8A is a graph showing p-gal activity in COS cells following 
transfection of FGF2-K84-pSVp-gal (lane 1), FGF2+K84+pSVp.gal (lane 2), 
I9Yj^'^Pl^l'3?:lS^^^^}r J^^i^PSVP-gal (lane 4); pSVp-ga^ (lan^ 5),_FCT^^ 
(lane 6), FGF2 (lane 7) and K84 (lane 8). 

Figure 8B is a graph showing completion for cell bindings. Lane 1, 
FGF2-K84-pSVp-gal complex transfected into COS cells; lane 2, FGF2-K84-pSVp-gal 
plus 100 i^g FGF2; lane 3, no complex. 
20 Figure 8C is a graph showing the attenuation of p-gal activity upon the 

addition of heparin during transfection. Lane 1, FGF2-K84-pSVp-gal+10jig heparin; 
lane 2, FGF2-K84-pSVp-gal; lane 3, heparin alone; lane 4, pSVp-gal alone. 

Figure 8D is a graph showing ligand targeting of DNA, pSVp-gal DNA 
alone (lane 1), FGF2-K84 (lane 2), histone H1-K84 (lane 3) and cytochrome C-K84 
25 (lane 4) were condensed with pSVp-gal DNA and added to BHK cells. P-gal activity 
was measured 48 hr later. 

Figure 9A is a graph showing the effect of chloroquine on P-gal 
expression, pSVp-gal and FGF2-K84 were mixed in the absence (lane 1) or presence 
(lane 2) of 100 fiM chloroquine and incubated for 1 hr at room temperature prior to 
30 addition of the complexes to COS cells. Lane 3, chloroquine alone; lane 4, DNA alone. 
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Figure 9B is a graph showing the effect of endosome disruptive peptide 
on p-gal expression. Lane 1, control; lane 2, FGF2.K84-pSVp-gal; lane 3, FGF2-K84- 
pSVp-gal+EDP. 

Figure 9C are photographs of cells stained for P-gal activity following 
5 transfection of COS cells with (right panel) or without (left panel) endosome disruptive 
peptide and FGF2-K84-pSVp-gal. 

Figure 10 is a photograph of a fluorograph analyzing cell-free translation 
products. Lane 1, no RNA; lane 2, saporin RNA; lane 3, luciferase RNA; lane 4, 
saporin RNA and luciferase RNA; lane 5, saporin RNA followed 30 min later with 
10 luciferase RNA. 

Figure 1 1 is a graph depicting direct cytotoxicity of cells transfected by a 
CaPO^ with an expression vector encoding saporin. Lane 1, mock transfection; lane 2, 
transfection with pSVp-gal; lane 3, transfection with saporin-containing vector. 

Figure 12 is a pair of graphs showing cytotoxicity of cells transfected 
15 with FGF2-K84-pSVSAP. Left panel, BHK21 cells; right panel, NIH 3T3 cells. Lane 
1, FGF2-K84-pSVp-gal; lane 2, FGF2-K84-pSVSAP. 

Figure 13A is a graph showing p-gal activity with an endosome 
disruptive peptide in the complex. 

Figure 13B is a graph showing p-gal activity with an endosome 
20 disruptive peptide in the complex. 

Figure 13C is a graph showing P-gal activity with an endosome 
disruptive peptide in the complex. 

Detailed Description of the Invention 
25 Definitions 

All U.S. patents and all publications mentioned herein are incorporated 
in their entirety by reference thereto. 

The "amino acids," which occur in the various amino acid sequences 
appearing herein, are identified according to their well known, three letter or one letter 



soooD: <wo_je3e3a2Aiju. 



wo 96/36362 



PCTAJS96/07164 



abbreviations. The nucleotides, which occur in the various DNA fragments, are 
designated with the standard single letter designations used routinely in the art. 

As used herein, to "bind to a receptor" refers to the ability of a ligand to 
specifically recognize and detectably bind to such receptors, as assayed by standard 
5 in vitro assays. For example, as used herein, binding measures the capacity of a VEGF 
conjugate, VEGF monomer, or VEGF dimer to recognize a VEGF receptor on a 
vascular endothelial cell, such as an aortic vascular endothelial cell line, using a 
procedure substantially as described in Moscatelli, J, Cell Physiol, 737:123-130, 1987. 

As used herein, "biological activity" refers to the in vivo activities of a 

10 compound or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity thus encompasses 
therapeutic effects and pharmaceutical activity of such compounds, compositions and 
mixtures. Such biological activity may be defined with reference to particular in vitro 
activities as measured in a defined assay. For example, reference herein to the 

15 biological activity of FGF, or fragments of FGF, refers to the ability of FGF to bind to 
cells bearing FGF receptors and intemalize a linked agent. Such activity is typically 
assessed in vitro by linking the FGF to a cytotoxic agent, such as saporin, contacting 
cells bearing FGF receptors, such as fibroblasts, with the conjugate and assessing cell 
proliferation or growth. In vivo activity may be determined using recognized animal 

20 models^ such as the mouse xenograft model for anti-tumor activity (see, e.g., Beitz et 
al., Cancer Research 52:227-230, 1992; Houghton et al.. Cancer Res, 4^2:535-539, 
1982; Bogden et al,, Cancer (Philadelphia) 45:10-20, 1981; Hoogenhout et al., Jnt, J. 
Radiat. Oncol, Biol Phys. 9:871-879, 1983; Stastny et al.. Cancer Res. 5J:5740-5744, 
1993). 

25 As used herein, reference to the "biological activity of a cytocide- 

encoding agent," such as DNA encoding saporin, refers to the ability of such agent to 
interfere with the metabolism of the cell by inhibiting protein synthesis. Such 
biological or cytotoxic activity may be assayed by any method known to those of skill 
in the art including, but not limited to, in vitro assays that measure protein synthesis and 

30 in vivo assays that assess cytotoxicity by measuring the effect of a test compound on 
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cell proliferation or on protein synthesis. Assays that assess cytotoxicity in targeted 
cells are particularly preferred. 

As used herein, a "conjugate" refers to a molecule that contains at least 
one receptor-internalized binding ligand and at least one nucleic acid binding domain 
5 that are linked directly or via a linker and that are produced by chemical coupling 
methods or by recombinant expression of chimeric DNA molecules to produce fusion 
proteins. 

A "cytocide-encoding agent" is a nucleic acid molecule that encodes a 
protein that inhibits protein synthesis. Such a protein may act by cleaving rRNA or 

10 ribonucloprotein, inhibiting an elongation factor, cleaving mRNA, or other mechanism 
that reduces protein synthesis to a level such that the cell cannot survive. The cytocide- 
encoding agent may contain additional elements besides the cytocide gene. Such 
elements include a promoter, enhancer, splice sites, transcription terminator, poly(A) 
signal sequence, bacterial or mammalian origins of replication, selection markers, and 

15 the like. 

As used herein, the term "cytotoxic agent" refers to a molecule capable 
of inhibiting cell function. The agent may inhibit proliferation or may be toxic to cells. 
A variety of cytotoxic agents can be used and include those that inhibit protein 
synthesis and those that inhibit expression of certain genes essential for cellular growth 

20 or survival. Cytotoxic agents include those that result in cell death and those that 
inhibit cell growth, proliferation and/or differentiation. 

As used herein, cytotoxic agents include, but are not limited to, saporin, 
the ricins, abrin and other ribosome inactivating proteins (RIPs), aquatic-derived 
cytotoxins, Pseudomonas exotoxin, inhibitors of DNA, RNA or protein synthesis, such 

25 as antisense nucleic acids, other metabolic inhibitors, such as DNA cleaving molecules, 
prodrugs, such as thymidine kinase from HSV and bacterial cytosine deaminase, and 
light activated porphyrin. While saporin is the preferred RIP, other suitable RIPs 
include ricin, ricin A chain, maize RIP, gelonin, diphtheria toxin, diphtheria toxin A 
chain, trichosanthin, tritin, pokeweed antiviral protein (PAP), mirabilis antiviral protein 

30 (MAP), Dianthins 32 and 30, abrin, monordin, bryodin, shiga, a catalytic inhibitor of 
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protein biosynthesis from cucumber seeds (see, e.g., WO 93/24620), Pseudomonas 
exotoxin, biologically active fragments of cytotoxins and others known to those of skill 
in this art. Suitable cytotoxic agents also include cytotoxic molecules that inhibit 
cellular metabolic processes, including transcription, translation, biosynthetic or 
5 degradative pathways, DNA synthesis, and other such processes that kill cells or inhibit 
cell proliferation. 

"Heparin-binding growth factor" refers to any member of a family of 
heparin-binding growth factor proteins, in which at least one member of the family 
binds heparin. Preferred growth factors in this regard include FGF, VEGF, and 
10 HBEGF. Such growth factors encompass isoforms, peptide fragments derived from a 
family member, splice variants, and single or multiple exons, some forms of which may 
not bind heparin. 

As used herein, to "hybridize" under conditions of a specified stringency 
is used to describe the stability of hybrids formed between two single-stranded nucleic 
15 acid molecules. Stringency of hybridization is typically expressed in conditions of ionic 
strength and temperature at which such hybrids are aimealed and washed. Typically 
high, medium and low stringency encompass the following conditions or equivalent 
conditions thereto: 

1) high stringency: 0.1 x SSPE or SSC, 0.1% SDS, 65^C 
20 2) medium stringency: 0.2 x SSPE or SSC, 0.1% SDS, SO^'C 

3) low stringency: 1.0 x SSPE or SSC, 0.1% SDS, 50°C. 
"Nucleic acid binding domain" (NABD) refers to a molecule, usually a 
protein, polypeptide, or peptide (but may also be a polycation) that binds nucleic acids, 
such as DNA or RNA. The NABD may bind to single or double strands of RNA or 
25 DNA or mixed RNA/DNA hybrids. The nucleic acid binding domain may bind to a 
specific sequence or bind irrespective of the sequence. 

As used herein, "nucleic acids" refer to RNA or DNA that are intended 
for internalization into a cell and includes, but are not limited to, DNA encoding a 
therapeutic protein, DNA encoding a cytotoxic protein, DNA encoding a prodrug, DNA 
30 encoding a cytocide, the complement of these DNAs, an antisense nucleic acid and 
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other such molecules. Reference to nucleic acids includes duplex DNA, single-stranded 
DNA, RNA in any form, including triplex, duplex or single-stranded RNA, anti-sense 
RNA, polynucleotides, oligonucleotides, single nucleotides, chimeras, and derivatives 
thereof. 

5 Nucleic acids may be composed of the well-known deoxyribonucleotides 

and ribonucleotides composed of the bases adenosine, cytosine, guanine, thymidine, and 
uridine. As well, various other nucleotide derivatives and non-phosphate backbones or 
phosphate-derivative backbones may be used. For example, because normal 
phosphodiester oligonucleotides (referred to as PO oligonucleotides) are sensitive to 
10 DNA- and RNA-specific nucleases, several resistant types of oligonucleotides have 
been developed in which the phosphate group has been altered to a phosphotriester, 
methylphosphonate, or phosphorothioate (see U.S. Patent No. 5,218,088). 

As used herein, "operative linkage" or operative association of 
heterologous DNA to regulatory and effector sequences of nucleotides, such as 
1 5 promoters, enhancers, transcriptional and translational stop sites, refers to the functional 
relationship between such DNA and such sequences of nucleotides. For example, 
operative linkage of heterologous DNA to a promoter refers to the physical and 
functional relationship between the DNA and the promoter such that the transcription of 
such DNA is initiated from the promoter by an RNA polymerase that specifically 
20 recognizes, binds to and transcribes the DNA in reading frame. 

As used herein, the term "polypeptide reactive with an FGF receptor" 
refers to any polypeptide that specifically interacts with an FGF receptor, preferably the 
high-affinity FGF receptor and that is transported into the cell by virtue of its 
interaction with the FGF receptor. Polypeptides reactive with an FGF receptor are also 
25 called FGF proteins. Such polypeptides include, but are not limited to, FGF-1 to FGF- 
9. For example, bFGF (FGF-2) should be generally understood to refer to polypeptides 
having substantially the same amino acid sequences and receptor-targeting activity as 
that of bovine bFGF or human bFGF. It is understood that differences in amino acid 
sequences can occur among FGFs of different species as well as among FGFs from 
30 individual organisms or species. In addition, chimeras or hybrids of any of FGF-1 



iOOCID: <WO_9e3S38aA1JL> 



wo 96/36362 



PCT/US96/07164 



11 

through FGF-9, or FGFs that have deletions (see, e,g., PCT Application No. WO 
90/02800), insertions or substitutions of amino acids are within the scope of FGF 
proteins, as long as the resulting peptide or protein specifically interacts with an FGF 
receptor and is internalized by virtue of this interaction. 
5 As used herein, a "prodrug" is a compound that metabolizes or otherwise 

converts an inactive, nontoxic compound to a biologically, pharmaceutically, 
therapeutically, of toxic active form of the compound. A prodrug may also be a 
pharmaceutically inactive compound that is modified upon administration to yield an 
active compound through metabolic or other processes. The prodrug may alter the 

10 metabolic stability or the transport characteristics of a drug, mask side effects or 
toxicity, improve or alter other characteristics or properties of a drug. By virtue of 
knowledge of pharmacodynamic processes and drug metabolism in vivo, those of skill 
in this art, once a pharmaceutically active compound is known, can design inactive 
forms of the compound (see, e.g., Nogrady, Medicinal Chemistry A Biochemical 

1 5 Approach, Oxford University Press, New York, pages 388-392, 1985). 

As used herein, "receptor-binding internalized ligand" or "ligand" refers 
to any peptide, polypeptide, protein or non-protein, such as a peptidomimetic, that is 
capable of binding to a cell-surface molecule and is internalized. Within the context of 
this invention, the receptor-binding internalized ligand is conjugated to a nucleic acid 

20 binding domain, either as a fusion protein or through chemical conjugation, and is used 
to deliver a cytocide-encoding or pro-drug encoding agent to a cell. In one aspect, the 
ligand is directly conjugated to a nucleic acid molecule, which may be further 
complexed with a nucleic acid binding domain. Such ligands include grov^h factors, 
cytokines, antibodies or fragments thereof, hormones, and the like. 

25 As used herein, "saporin" (abbreviated herein as SAP) refers to 

polypeptides that are isolated from the leaves or seeds of Saponaria officinalis, as well 
as modified forms that have amino acid substitutions, deletions, insertions or additions, 
which still express substantial ribosome inactivating activity. Purified preparations of 
saporin are frequently observed to include several molecular isoforms of the protein. It 

30 is understood that differences in amino acid sequences can occur in saporin from 
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different species as well as between saporin molecules from individual organisms of the 
same species. Saporin for use herein may be purified from leaves, chemically 
synthesized, or synthesized by expression of DNA encoding a saporin polypeptide. 

As used herein, a "targeted agent" is a nucleic acid molecule that is 
5 intended for internalization by complexing or linkage to a receptor-binding internalized 
ligand, and nucleic acid binding domain, and that upon internalization in some manner 
alters or affects cellular metabolism, growth, activity, viability or other property or 
characteristic of the cell. 

As used herein, a "therapeutic nucleic acid" describes any nucleic acid 

10 molecule used in the context of the invention that modifies gene transcription or 
translation. This term also includes nucleic acids that bind to sites on proteins. It 
includes, but is not limited to, the following types of nucleic acids: nucleic acids 
encoding a protein, antisense RNA, DNA intended to form triplex molecules, 
extracellular protein binding oligonucleotides, and small nucleotide molecules. A 

15 therapeutic nucleic acid may be used to effect genetic therapy by serving as a 
replacement for a defective gene, by encoding a therapeutic product, such as TNF, or by 
encoding a cytotoxic molecule, especially an enzyme, such as saporin. The therapeutic 
nucleic acid may encode all or a portion of a gene, and may ftmction by recombining 
with DNA already present in a cell, thereby replacing a defective portion of a gene. It 

20 may also encode a portion of a protein and exert its effect by virtue of co-suppression of 
a gene product. 

Preparation Of Receptor-Binding Internalized Ligand, Nucleic Acid Binding 
Domain And Cytocide-Encoding Agent Complexes 
25 As noted above, the present invention provides cytocide-encoding agents 

complexed with a conjugate of a receptor-binding internalized ligand and a nucleic acid 
binding domain. Upon binding to an appropriate receptor, the complex is internalized 
by the cell and is trafficked through the cell via the endosomal compartment, where at 
least a portion of the complex may be cleaved. 
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A. Receptor-binding internalized ligands 

As noted above, receptor-binding internalized ligands are used to deliver 
a cytocide-encoding agent to a cell expressing an appropriate receptor on its cell 
surface. Numerous molecules that bind specific receptors have been identified and are 
5 suitable for use in the present invention. Such molecules include growth factors, 
cytokines, and antibodies. Many growth factors and families of growth factors share 
structural and functional features and may be used in the present invention. One such 
family of growth factors specifically binds to heparin. The ability of heparin-binding 
growth factors to interact with, heparin appears in general to be a reflection of a 

10 physiologically more relevant interaction occurring in vivo between these factors and 
heparin sulfate proteoglycan molecules, which are found on the surface of cells and in 
extracellular matrix. Heparin-binding growth factors include the fibroblast growth 
factors FGF-1 through FGF-9, vascular endothelial growth factor (VEGF), and heparin 
binding-epidermal growth factor (HBEGF). Antibodies that are specific to cell surface 

15 molecules expressed by a selected cell type are readily generated as monoclonals or 
polyclonal antisera. Many such antibodies are available (e.g., American Type Culture 
Collection, Rockville, MD). Other growth factors, such as PDGF (platelet-derived 
growth factor), EGF (epidermal growth factor), TGF-a (tumor growth factor), TGF-p, 
IGF-I (insulin-like growth factor), and IGF-II also bind to specific identified receptors 

20 on cell surfaces and may be used in the present invention. Cytokines, including 
interleukins, CSFs (colony stimulating factors), and interferons, have specific receptors, 
which are mostly found on hematopoeitic cells, and may be used as described herein. 
These ligands are discussed in more detail below. 

Fragments of these ligands may be used within the present invention, so 

25 long as the fragment retains the ability to bind to the appropriate cell surface molecule. 
Likewise, ligands with substitutions or other alterations, but which retain binding 
ability, may also be used. 



30 



1. Fibroblast growth factors 

One family of growth factors that has a broad spectrum of activities is 
the fibroblast growth factor (FGF) family. These proteins share the ability to bind to 
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heparin, induce intracellular receptor-mediated tyrosine phosphorylation and the 
expression of the c-fos mRNA transcript, and stimulate DNA synthesis and ceil 
proliferation. This family of proteins includes FGFs designated FGF-I (acidic FGF 
(aFGF)), FGF-2 (basic FGF (bFGF)), FGF-3 (int-2) (see, e.g., Moore et al., EMBO J. 
5 J:919-924, 1986), FGF-4 (hst-l/K-FGF) (see, e.g., Sakamoto et al., Proc. Natl. Acad. 
Sci. USA 5(J:1836-1840, 1986; U.S. Patent No. 5,126,323), FGF-5 (see, e.g., U.S. Patent 
No. 5,155,217), FGF-6 (hst-2) (see, e.g., published European Application EP 0 488 196 
A2; Uda et al.. Oncogene 7:303-309, 1992), FGF-7 (keratinocyte growth factor) (KGF) 
(see, e.g. Finch et al.. Science 245:152-155, 1985; Rubin etal., Proc. Natl. Acad. Sci. 
10 USA 55:802-806, 1989; and International Application WO 90/08771), FGF-8 (see, e.g., 
Tanaka et al., Proc Natl. Acad Sci. USA 5P:8528-8532, 1992); and FGF-9 (see, 
Miyamoto et al., Cell. Biol. 75:4251-4259, 1993). 

DNA encoding FGF peptides and/or the amino acid sequences of FGFs 
are known to those of skill in the art. DNA encoding an FGF may be prepared 
15 synthetically based on a known amino acid or DNA sequence, isolated using methods 
known to those of skill in the art, or obtained from commercial or other sources. DNA 
encoding virtually all of the FGF family of peptides is known. For example, DNA 
encoding human FGF-1 (Jaye et al., Science 255:541-545, 1986; U.S. Patent 
No. 5,223,483), bovine FGF-2 (Abraham et al.. Science 255:545-548, 1986; Esch et al., 
20 Proc. Natl. Acad Sci. USA 52:6507-6511, 1985; and U.S. Patent No. 4,956,455), 
human FGF-2 (Abraham et al., EMBO J. 5:2523-2528, 1986; U.S. Patent 
No. 4,994,559; U.S. Patent No. 5,155,214; EP 470,1838; and Abraham et al.. Quant. 
Biol. 57:657-668, 1986) and rat FGF-2 (see Shimasaki et al., Biochem. Biophys. Res. 
Comm., 1988, and Kurokawa et al.. Nucleic Acids Res. 16:5201, 1988), FGF-3, FGF-6, 
25 FGF-7 and FGF-9 are known (see also U.S. Patent No. 5,155,214; U.S. Patent 
No. 4,956,455; U.S. Patent No. 5,026,839; U.S. Patent No. 4,994.559, 
EP 0,488,196 A2, DNASTAR, EMBL or GenBank databases, and references discussed 
above and below). DNA encoding an FGF may be produced from any of the preceding 
DNA fragments by substitution of degenerate codons. It is understood that once the 
30 complete amino acid sequence of a peptide, such as an FGF peptide, and the DNA 
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fragment encoding such peptide are available to those of skill in this art, it is routine to 
substitute degenerate codons and produce any of the possible DNA fragments that 
encode such peptide. It is also generally possible to synthesize DNA encoding such 
peptide based on the amino acid sequence. 
5 Thus, as used herein, "FGF" refers to polypeptides having amino acid 

sequences of native FGF proteins, as well as modified sequences, having amino acid 
substitutions, deletions, insertions or additions in the native protein but retaining the 
ability to bind to FGF receptors and to be internalized. It is understood that differences 
in amino acid sequences can occur among FGFs of different species as w^ell as among 

10 FGFs from individual organisms or species. 

Reference to FGFs is intended to encompass proteins isolated from 
natural sources as well as those made synthetically, as by recombinant means or 
possibly by chemical synthesis. FGF also encompasses muteins that possess the ability 
to bind to FGF-receptor expressing cells. Such muteins include, but are not limited to, 

15 those produced by replacing one or more of the cysteines with serine as described 
herein or that have any other amino acids deleted or replaced as long as the resulting 
protein has the ability to bind to FGF-receptor bearing cells and intemalize the linked 
targeted agent. Typically, such muteins will have conservative amino acid changes, 
such as those set forth below in Table 1. DNA encoding such muteins will, unless 

20 modified by replacement of degenerate codons, hybridize under conditions of at least 
low stringency to native DNA sequence encoding the starting FGF. 

Acidic and basic FGF are about 55% identical at the amino acid level 
and are highly conserved among species. The other members of the FGF family have a 
high degree of amino acid sequence similarities and common physical and biological 

25 properties with FGF-1 and FGF-2, including the ability to bind to one or more FGF 
receptors. Basic FGF, int-2, hst-l/K-FGF, FGF-5, hst-2/FGF-6 and FGF-8 may have 
oncogenic potential; bFGF is expressed in melanomas, int-2 is expressed in mammary 
tumor virus and hst-l/K-FGF is expressed in angiogenic tumors. Acidic FGF, bFGF, 
KGF and FGF-9 are expressed in normal cells and tissues. 
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FGFs exhibit a mitogenic effect on a wide variety of mesenchymal, 
endocrine and neural cells and are also important in differentiation and development. 
Of particular interest is their stimulatory effect on collateral vascularization and 
angiogenesis. In some instances, FGF-induced mitogenic stimulation may be 
5 detrimental. For example, cell proliferation and angiogenesis are an integral aspect of 
tumor growth. Members of the FGF family, including bFGF, are thought to play a 
pathophysiological role, for example, in tumor development, rheumatoid arthritis, 
proliferative diabetic retinopathies and other complications of diabetes. 

The effects of FGFs are mediated by high affinity receptor tyrosine 
10 kinases present on the cell surface of FGF-responsive cells (see, e.g., PCT WO 
91/00916, WO 90/05522, PCT WO 92/12948; Imamura et al., Biochem. Biophys. Res. 
Comm. 755:583-590, 1988; Huang et al., J. Biol. Chem. 267:9568-9571, 1986; Partanen 
ei3l.,EMBOJ. lO-.UAl, 1991; and Moscatelli, J. Cell. Physiol. 131:123, 1987). Lower 
affinity receptors also appear to play a role in mediating FGF activities. The high 
15 affinity receptor proteins are single chain polypeptides with molecular weights ranging 
from 110 to 150 kD, depending on cell type that constitute a family of structurally 
related FGF receptors. Four FGF receptor genes have been identified, and three of 
these genes generate multiple mRNA transcripts via alternative splicing of the primary 
transcript. 

20 

2. Vascular endothelial growth factors 

Vascular endothelial growth factors (VEGFs) were identified by their 
ability to directly stimulate endothelial cell growth, but do not appear to have mitogenic 
effects on other types of cells. VEGFs also cause a rapid and reversible increase in 
25 blood vessel permeability. The members of this family have been referred to variously 
as vascular endothelial growth factor (VEGF), vascular permeability factor (VPF) and 
vasculotropin {see, e.g., Plouet et al., EMBO J. 5:3801-3806, 1989). Herein, they are 
collectively referred to as VEGF. 

VEGF was originally isolated from a guinea pig heptocarcinoma cell 
30 line, line 10 {see, e.g., U.S. Patent No. 4,456,550), and has subsequently been identified 
in humans and in normal cells. It is expressed during normal development and in 
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certain normal adult organs. Purified VEGF is a basic, heparin-binding, homodimeric 
glycoprotein that is heat-stable, acid-stable and may be inactivated by reducing agents. 

DNA sequences encoding VEGF and methods to isolate these sequences 
may be found primarily in U.S. Patent No. 5,240,848, U.S. Patent No. 5,332,671, U.S. 
5 Patent No. 5,219,739, U.S. Patent No. 5,194,596, and Houch etal., Mol Endocrin, 
5:180, 1991. As used herein, "DNA encoding a VEGF peptide or polypeptide" refers to 
any of the DNA fragments set forth herein as coding such peptides, to any such DNA 
fragments known to those of skill in the art, any DNA firagment that encodes a VEGF 
that binds to a VEGF receptor and is internalized thereby. VEGF DNA may be isolated 

10 from a human cell library, for example, using any of the preceding DNA fragments as a 
probe or any DNA fragment that encodes any of the VEGF peptides set forth in SEQ ID 
NOs. 1-4. It is understood that once the complete amino acid sequence of a peptide, 
such as a VEGF peptide, and the DNA fragment encoding such peptide are available to 
those of skill in this art, it is routine to substitute degenerate codons and produce any of 

15 the possible DNA fragments that encode such peptide. It is also generally possible to 
synthesize DNA encoding such peptide based on the amino acid sequence. 

VEGF family members arise from a single gene organized as eight exons 
and spanning approximately 14 kb in the human genome. Four molecular species of 
VEGF result from alternative splicing of mRNA and contain 121, 165, 189 and 206 

20 amino acids. The four species have similar biological activities, but differ markedly in 
their secretion patterns. The predominant isoform secreted by a variety of normal and 
transformed cells is VEGFj^s- Transcripts encoding VEGF and VEGF, 39 are 
detectable in most cells and tissues that express the VEGF gene. In contrast, VEGF206 is 
less abundant and has been identified only in a human fetal liver cDNA library. 

25 VEGF, 2, is a weakly acidic polypeptide that lacks the heparin binding domain and, 
consequently, does not bind to heparin. VEGF, 59 and VEGF206 are more basic than 
VEGF, 65 and bind to heparin with greater affinity. Although not every identified VEGF 
isoform binds heparin, all isoforms are considered to be heparin-binding growth factors 
within the context of this invention. 
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30 



The secreted isofonns, VEGF,^, and VEGF,,^ are preferred VEGF 
proteins. The longer isoforms, VEGF.g, and VEGF,„„ are almost completely bound to 
the extracellular matrix and need to be released by an agent, such as suramin, heparin or 
heparinase, or plasmin. Other preferred VEGF proteins contain various combinations 
of VEGF exons, such that the protein still binds VEGF receptor and is internalized. It is 
not necessary that a VEGF protein used in the context of this invention either retain any 
of its in vivo biological activities, such as stimulating endothelial cell growth, or bind 
heparin- It is only necessary that the VEGF protein or fragment thereof bind the VEGF 
receptor and be internalized into the cell bearing the receptor. However, it may be 
desirable in certain contexts for VEGF to manifest certain of its biological activities. 
For example, if VEGF is used as a carrier for DNA encoding a molecule useful in 
wound healing, it would be desirable that VEGF exhibit vessel permeability activity and 
promotion of fibroblast migration and angiogenesis. It will be apparent from the 
teachings provided within the subject application which of the activities of VEGF are 
1 5 desirable to maintain. 

VEGF promotes an array of responses in endothelium, including blood 
vessel hyperpermeability, endothelial cell growth, angiogenesis, and enhanced glucose 
transport. VEGF stimulates the growth of endothelial cells from a variety of sources 
(including brain capillaries, fetal and adult aortas, and umbilical veins) at low 
concentrations, but is reported to have no effect on the growth of vascular smooth 
muscle cells, adrenal cortex cells, keratmocytes, lens epithelial cells, or BHK-21 
fibroblasts. VEGF also is a potent polypeptide regulator of blood vessel function; it 
causes a rapid but transient increase in microvascular permeability without causing 
endothelial cell damage or mast cell degranulation, and its action is not blocked by 
antihistamines. VEGF has also been reported to induce monocyte migration and 
activation and has been implicated as a tumor angiogenesis factor in some human 
gliomas. Also, VEGF is a chemoattractant for monocytes and VEGF has been shown to 
enhance the activity of die inflammatory mediator tumor necrosis factor (TNF). 

Quiescent and proliferating endothelial cells display high-affinity 
binding to VEGF, and endothelial cell responses to VEGF appear to be mediated by 



20 
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high affinity cell surface receptors (see, e.g.. International Application WO 92/14748, 
which is based on U.S. Applications Serial No. 08/657,236, de Vries et al., Science 
255:989-91, 1992; Terman et al., Biochem. Biophys. Res. Commun. 757:1579-1586, 
1992; Kendall et al., Proc, Natl, Acad ScL USA 90:10705-10709, 1993; and Peters et 
5 al., Proc. Natl Acad. Sci. USA 90:8915-8919, 1993). Two tyrosine kinases have been 
identified as VEGF receptors. The first, known as fins-like tyrosine kinase or FLT is a 
receptor tyrosine kinase that is specific for VEGF. In adult and embryonic tissues, 
expression of FLT mRNA is localized to the endothelium and to populations of cells 
that give rise to endothelium. The second receptor, KDR (human kinase insert domain- 

10 containing receptor), and its mouse homologue FLK-1, are closely related to FLT, The 
KDR/FLK-1 receptor is expressed in endothelium during the fetal growth stage, during 
earlier embryonic development, and in adult tissues. In addition, messenger RNA 
encoding FLT and KDR have been identified in tumor blood vessels and specifically by 
endothelial cells of blood vessels supplying glioblastomas. Similarly, FLT and KDR 

15 mRNAs are upregulated in tumor blood vessels in invasive human colon 
adenocarcinoma, but not in the blood vessels of adjacent normal tissues. 

3. Heparin-binding epidermal growth factors 

Several new mitogens in the epidermal growth factor protein family have 
20 recently been identified that display the ability to bind the glycosaminoglycan heparin. 
Among these is the mitogen known as heparin-binding EGF-like growth factor 
(HBEGF), which elutes from heparin-Sepharose"^^ columns at about 1 .0 - 1 .2 M NaCl 
and which was first identified as a secreted product of cultured human monocytes, 
macrophages, and the macrophage-like U-937 cell line (Higashiyama et al.. Science 
25 257:936-939, 1991; Besner et al.. Cell Regul 7:811-19, 1990). HBEGF has been 
shown to interact with the same high affinity receptors as EGF on bovine aortic smooth 
muscle cells and human A431 epidermoid carcinoma cells (Higashiyama, Science 
257:936-939, 1991). 

HBEGFs exhibit a mitogenic effect on a wide variety of cells including 
30 BALB/c 3T3 fibroblast cells and smooth muscle cells, but unlike VEGFs, are not 
mitogenic for endothelial cells (Higashiyama et al., Science 257:936-939, 1991). 



MSDOCID: <WO_9e3e3a2A1JL> 



wo 96/36362 

PCT/US96/07164 

20 



HBEGF also has a stimulatory effect on collateral vascularization and angiogenesis. 
Members of the HBEGF family are thought to play a pathophysiological role, for 
example, in a variety of tumors, such as bladder carcinomas, breast tumors and non- 
small cell lung tumors. Thus, these cell types are likely candidates for delivery of 
5 cytocide-encoded agents. 

HBEGF isolated from U-937 cells is heterogeneous in structure and 
contains at least 86 amino acids and two sites of O-linked glycosyl groups 
(Higashiyama et al., J. Biol. Chem. 267:6205-6212, 1992). The carboxyl-terminal half 
of the secreted HBEGF shares approximately 35% sequence identity with human EGF, 
10 including six cysteines spaced in the pattern characteristic of members of the EGF 
protein family. In contrast, the amino-terminal portion of the mature factor is 
characterized by stretches of hydrophilic residues and has no structural equivalent in 
EGF. Site-directed mutagenesis of HBEGF and studies with peptide fragments have 
indicated that the heparin-binding sequences of HBEGF reside primarily in a 21 amino 
1 5 acid stretch upstream of and slightly overiapping the EGF-like domain. 

The effects of HBEGFs are mediated by EGF receptor tyrosine kinases 
expressed on cell surfaces of HBEGF-responsive cells (see, e.g., U.S. Patent Nos. 
5,183,884 and 5,218,090; and Ullrich et al.. Nature iOP:41 13-425, 1984). The EGF 
receptor proteins, which are single chain polypeptides with molecular weights 170 kD, 
20 constitute a family of structurally related EGF receptors. Cells known to express the 
EGF receptors include smooth muscle cells, fibroblasts, keratinocytes, and numerous 
human cancer cell lines, such as the: A431 (epidermoid); KB3-1 (epidermoid); COLO 
205 (colon); CRL 1739 (gastric); HEP G2 (hepatoma); LNCAP (prostate); MCF-7 
(breast); MDA-MB-468 (breast); NCI 417D (lung); MG63 (osteosarcoma); U-251 
25 (glioblastoma); D-54MB (glioma); and S W-1 3 (adrenal). 

For the purposes of this invention, HBEGF need only bind a specific 
HBEGF receptor and be internalized. Any member of the HBEGF family, whether or 
not it binds heparin, is useful within the context of this invention as long as it meets the 
requirements set forth above. Members of the HBEGF family are those that have 
30 sufficient nucleotide identity to hybridize under normal stringency conditions (typically 
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greater than 75% nucleotide identity). Subfragments or subportions of a fiilMength 
HBEGF may also be desirable. One skilled in the art may find from the teachings 
provided within that certain biological activities are more or less desirable, depending 
upon the application, 

5 DNA encoding an HBEGF peptide or polypeptide refers to any DNA 

fragment encoding an HBEGF, as defined above. Exemplary DNA fragments include: 
any such DNA fragments known to those of skill in the art; any DNA fragment that 
encodes an HBEGF or fragment tliat binds to an HBEGF receptor and is intemalized 
thereby; and any DNA fi-agment that encodes any of the HBEGF polypeptides set forth 

10 in SEQ ID NOs. 5-8. Such DNA sequences encoding HBEGF fragments are available 
fi-om publicly accessible databases, such as: EMBL, GenBank (Accession Nos. M93012 
(monkey) and M60278 (human)); the plasmid pMTN-HBEGF (ATCC #40900) and 
pAX-HBEGF (ATCC #40899) (described in PCT Application WO/92/06705); and 
Abraham et al., Biochem, Biophys. Res. Comm. 790:125-133, 1993). Unless modified 

15 by replacement of degenerate codons, DNA encoding HBEGF polypeptides will 
hybridize under conditions of at least low stringency to DNA encoding a native human 
HBEGF (e.g., SEQ ID NO. 9). In addition, any DNA fragment that may be produced 
from any of the preceding DNA fragments by substitution of degenerate codons is also 
contemplated for use herein. It is understood that since the complete amino acid 
20 sequence of HBEGF polypeptides, and DNA fragments encoding such peptides, are 
available to those of skill in this art, it is routine to substitute degenerate codons and 
produce any of the possible DNA fragments that encode such HBEGF polypeptides. It 
is also generally possible to synthesize DNA encoding such peptides based on the 
amino acid sequence. 

25 

4. Other receptor-binding intemalized ligands 

Other receptor-binding ligands may be used in the present invention. 
Any protein, polypeptide, analogue, or fragment that binds to a cell-surface receptor and 
is intemalized may be used. In general, in addition to the specific heparin-binding 
30 growth factors discussed above, other growth factors and cytokines are especially well 
suited for use. These ligands may be produced by recombinant or other means in 
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preparation for conjugation to the nucleic acid binding domain. The DNA sequences 
and methods to obtain the sequences of these receptor-binding internalized ligands are 
well known. For example, these ligands include CSF-1 (GenBank Accession Nos. 
M11038, M37435; Kawasaki et al.. Science 230:291-296, 1985; Wong et al.. Science 
5 235:1504-1508, 1987); GM-CSF (GenBank Accession No. X03021; Miyatake et al., 
EMBO J. ^:256 1-2568, 1985); IFN-a (interferon) (GenBank Accession No. A02076; 
Patent No. WO 8502862-A, July 4, 1985); IFN-y (GenBank Accession No. A02137; 
Patent No. WO 8502624-A, June 20, 1985); hepatocyte growth factor (GenBank 
Accession No. X16323, S80567, X57574; Nakamura, et al.. Nature 342:440-443, 1989; 
10 Nakamura et al.. Prog. Growth Factor Res. i:67-85, 1991; Miyazawa et al., Eur. J. 
Biochem. 197:15-22, 1991); IGF-Ia (Insulin-like growth factor la) (GenBank Accession 
No. X56773, S61841; Sandberg-Nordqvist et al.. Brain Res. Mol. Brain Res. 72:275- 
277, 1992; Sandberg, Sandberg-Nordqvist et al.. Cancer Res. 55:2475-2478, 1993); 
IGF-Ib (GenBank Accession No. X56774 S61860; Sandberg-Nordqvist et al.. Brain 
15 Res. Mol. Brain Res. 72:275-277, 1992; Sandberg-Nordqvist, A.C., Cancer Res. 
55:2475-2478, 1993); IGF-I (GenBank Accession No. X03563, M29644; Dull et al.. 
Nature 310:11\-1%\, 1984; Rail et al., Meth. Enzymol. 146:239-24%, 1987); IGF-II 
(GenBank Accession No. J03242; Shen et al., Proc. Natl. Acad, Sci. USA 55:1947- 
1951, 1988); IL-l-a (interleukin 1 alpha) (GenBank Accession No. X02531, Ml 5329; 
20 March et al.. Nature 575:641-647, 1985; Nishida et al., Biochem. Biophys. Res. 
Commun. 143:345-352, 1987); IL-l-p (interleukin 1 beta) (GenBank Accession 
No. X02532, M15330, M15840; March et al.. Nature 315:641-641, 1985; Nishida et al., 
Biochem. Biophys. Res. Commun. 143:345-352, 1987; Bensi et al.. Gene 52:95-101, 
1987); IL-1 (GenBank Accession No. K02770, M54933, M38756; Auron et al., Proc. 
25 Natl Acad. Sci. USA 81:1901-191 1, 1984; Webb et al.. Adv. Gene Technol. 22:339-340, 
1985); IL-2 (GenBank Accession No. A 14844, A2I785, X00695, X00200, X00201, 
X00202; Lupker et al.. Patent No. EP 0307285-A, March 15, 1989; Perez et al.. Patent 
No. EP 0416673-A, March 13, 1991; Holbrook et al.. Nucleic Acids Res. 72:5005-5013, 
1984; Degrave et al., EMBO J. 2:2349-2353, 1983; Taniguchi et al.. Nature 302:305- 
30 310, 1983); IL-3 (GenBank Accession No. M14743, M20137; Yang et al.. Cell 47:3-\0, 
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1986; Otsuka et al., J. Immunol 7^(0:2288-2295, 1988); IL-4 (GenBank Accession 
No. M13982; Yokota et al., Proc. Natl Acad. ScL USA <Si:5894-5898, 1986); IL-5 
(GenBank Accession No, X04688, J03478; Azuma et al.. Nucleic Acids Res. 7¥:9149- 
9158, 1986; Tanabe et ai., J. Biol Chem. 262:16580-16584, 1987); IL-6 (GenBank 
5 Accession No. Y00081, X04602, M54894, M38669, M14584; Yasukawa et al., EMBO 
J. 6:2939-2945, 1987; Hirano et al.. Nature 324:13-16, 1986; Wong et al., Behring Inst. 
Mitt. 53:40-47, 1988; May et al., Proc. Natl Acad. ScL USA 53:8957-8961, 1986); IL-7 
(GenBank Accession No. J04156; Goodwin et al., Proc. Natl Acad. SclUSA 56:302- 
306, 1989); IL-8 (GenBank Accession No. Zl 1686; Kusner et al.. Kidney Int. 3P:1240- 

10 1248, 1991); IL-10 (GenBank Accession No. X78437, M57627; Vieira et al., Proc. 
Natl Acad. ScL USA 55:1172-1176, 1991); IL-11 (GenBank Accession No. M5 7765 
M37006; Paul et al., Proc. Natl Acad ScL USA 57:7512-7516, 1990); IL-13 (GenBank 
Accession No. X69079, U10307; Minty et al.. Nature 562:248-250, 1993; Smimov, 
Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, June 2, 1994); TNF-a 

15 (Tumor necrosis factor) (GenBank Accession No. A21522; Patent No. GB 2246569-Al, 
February 5, 1992); TNF-p (GenBank Accession No. D12614; Matsuyama et al., FEBS 
LETTERS 502:141-144, 1992). DNA sequences of other suitable receptor-binding 
internalized ligands may be obtained from GenBank or EMBL DNA databases, reverse- 
synthesized from protein sequence obtained from PIR database or isolated by standard 

20 methods (Sambrook et al., supra) from cDNA or genomic libraries. 

5. Modifications of receptor-binding internalized ligands 

These ligands may be customized for a particular application. Means for 
modifying proteins is provided below. Briefly, additions, substitutions and deletions of 
25 amino acids may be produced by any commonly employed recombinant DNA method. 

An amino acid residue of FGF, VEGF, HBEGF or other receptor- 
binding internalized ligand is non-essential if the polypeptide that has been modified by 
deletion of the residue possesses substantially the same ability to bind to its receptor 
and internalize a linked agent as the unmodified polypeptide. 
30 As noted above, any polypeptide or peptide analogue, including 

peptidomimetics, that is reactive with an FGF receptor, a VEGF receptor, an HBEGF 
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receptor, other growth factor receptor (e.g., PDGF receptor), cytokine receptor or other 
cell surface molecule including members of the families and fragments thereof, or 
constrained analogs of such peptides that bind to the receptor and internalize a linked 
targeted agent may be used in the context of this invention. Members of the FGF 
5 peptide family, including FGF-1 to FGF-9, are preferred. Modified peptides, especially 
those lacking proliferative function, and chimeric peptides, which retain the specific 
binding and internalizing activities are also contemplated for use herein. 

A modification that is effected substantially near the N-terminus of a 
polypeptide is generally effected within the first about ten residues of the protein. Such 

10 modifications include the addition or deletion of residues, such as the addition of a 
cysteine to facilitate conjugation and form conjugates that contain a defined molar ratio, 
preferably a ratio of 1 : 1 of the polypeptides. 

DNA encoding one of the receptor-binding internalized ligands 
discussed above may be mutagenized using standard methodologies to delete or replace 

15 any cysteine residues that are responsible for aggregate formation. If necessary, the 
identity of cysteine residues that contribute to aggregate formation may be determined 
empirically, by deleting and/or replacing a cysteine residue and ascertaining whether the 
resulting protein aggregates in solutions containing physiologically acceptable buffers 
and salts. In addition, fragments of these receptor-binding internalized ligands may be 

20 constructed and used. The binding region of many of these ligands have been 
delineated. Fragments may also be shown to bind and internalize by any one of the 
tests described herein. 

Modification of the polypeptide may be effected by any means known to 
those of skill in this art. The preferred methods herein rely on modification of DNA 

25 encoding the polypeptide and expression of the modified DNA. 

Merely by way of example, DNA encoding the FGF polypeptide may be 
isolated, synthesized or obtained from commercial sources (the amino acid sequences of 
FGF-1 - FGF-9 are set forth in SEQ ID NOs. 10-18: DNA sequences may be based on 
these amino acid sequences or may be obtained from public DNA databases and 

30 references (see, e.g., GenBank, see also U.S. Patent No. 4,956,455, U.S. Patent 
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No. 5,126,323, U.S. Patent No. 5,155,217, U.S. Patent No. 4,868,1 13, PCT Application 
WO 90/08771, EP Application 0 488 196 A2, and Miyamoto et al., Mol Cell Biol 
75:4251-4259, 1993). Expression of a recombinant FGF-2 protein in yeast and E, coli 
is described in Barr et al., J, Biol Chem. 2(55:16471-16478, 1988, in PCT Application 
5 Serial No. PCT/US93/05702 and United States Application Serial No. 07/901,718. 
Expression of recombinant FGF proteins may be performed as described herein or using 
methods known to those of skill in the art. 

Similarly, DNA encoding any of the other receptor-binding internalized 
ligands, including VEGF, HBEGF, IL-1, IL-2, and other cytokines and growth factors 
10 may also be isolated, synthesized, or obtained from commercial sources. As noted 
above, DNA sequences are available in public databases, such as GenBank. Based on 
these sequences, oligonucleotide primers may be designed and used to amplify the gene 
from cDNA or mRNA by polymerase chain reaction technique as one means of 
obtaining DNA. 

1 5 Mutations may be made by any method known to those of skill in the art, 

including site-specific or site-directed mutagenesis of DNA encoding the protein and 
the use of DNA amplification methods using primers to introduce and amplify 
alterations in the DNA template, such as PGR splicing by overlap extension (SOE). 
Site-directed mutagenesis is typically effected using a phage vector that has single- and 

20 double-stranded forms, such as Ml 3 phage vectors, which are well-known and 
commercially available. Other suitable vectors that contain a single-stranded phage 
origin of replication may be used (see, e.g., Veira et al., Meth, Enzymol J 5:3, 1987). In 
general, site-directed mutagenesis is performed by preparing a single-stranded vector 
that encodes the protein of interest (i.e., a member of the FGF family or a cytotoxic 

25 molecule, such as a saporin). An oligonucleotide primer that contains the desired 
mutation within a region of homology to the DNA in the single-stranded vector is 
annealed to the vector followed by addition of a DNA polymerase, such as E. coli DNA 
polymerase I (Klenow fragment), which uses the double stranded region as a primer to 
produce a heteroduplex in which one strand encodes the altered sequence and the other 

30 the original sequence. The heteroduplex is introduced into appropriate bacterial cells 
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and clones that include the desired mutation are selected. The resulting altered DNA 
molecules may be expressed recombinantly in appropriate host cells to produce the 
modified protein. 

Suitable conservative substitutions of amino acids are well-known and 
5 may be made generally without altering the biological activity of the resulting molecule. 
For example, such substitutions may be made in accordance with those set forth in 
TABLE 1 as follows: 

TABLE 1 

10 



anginal resiuue 


Conservative 
substitution 


Ala (A) 


Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


He (I) 


Leu; Val 


Leu (L) 


He; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; He 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Tip (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


lie; Leu 



Other similarly conservative substimtions may be made. If necessary, such 
substitutions may be determined empirically merely by testing the resuUing modified 
protein for the ability to bind to and intemalize upon binding to the appropriate 
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receptors. Those that retain this ability are suitable for use in the conjugates and 
methods herein. In addition, muteins of the FGFs are known to those of skill in the art 
(see, e.g., U.S. Patent No. 5,175,147; PCT Application No. WO 89/00198, U.S. Serial 
No. 07/070,797; PCT Application No. WO 91/15229; and U.S. Serial No. 07/505,124), 

5 

B. Nucleic acid binding domains 

As previously noted, nucleic acid binding domains (NABD) interact with 
the target nucleic acid either in a sequence-specific manner or a sequence-nonspecific 
manner. When the interaction is non-specific, the nucleic acid binding domain binds 

10 nucleic acid regardless of the sequence. For example, poly-L-lysine is a basic 
polypeptide that binds to oppositely charged DNA. Other highly basic proteins or 
polycationic compounds, such as histones, protamines, and spermidine, also bind to 
nucleic acids in a nonspecific maimer. 

Many proteins have been identified that bind specific sequences of DNA. 

15 These proteins are responsible for genome replication, transcription and repair of 
damaged DNA. The transcription factors regulate gene expression and are a diverse 
group of proteins. These factors are especially well suited for purposes of the subject 
invention because of their sequence-specific recognition. Host transcription factors 
have been grouped into seven well-established classes based upon the structural motif 

20 used for recognition. The major families include helix-tum-helix (HTH) proteins, 
homeodomains, zinc finger proteins, steroid receptors, leucine zipper proteins, the 
helix-loop-helix (HLH) proteins, and P-sheets. Other classes or subclasses may 
eventually be delineated as more factors are discovered and defined. Proteins from 
those classes or proteins that do not fit within one of these classes but bind nucleic acid 

25 in a sequence-specific manner, such as SV40 T antigen and p53 may also be used. 

These families of transcription factors are generally well-known (see 
GenBank; Pabo and Sauer, Ann, Rev. Biochem. (57:1053-1095, 1992; and references 
below). Many of these factors are cloned and the precise DNA-binding region 
delineated in certain instances. When the sequence of the DNA-binding domain is 

30 known, a gene encoding it may be synthesized if the region is short. Alternatively, the 
genes may be cloned from the host genomic libraries or from cDNA libraries using 
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oligonucleotides as probes or from genomic DNA or cDNA by polymerase chain 
reaction methods. Such methods may be found in Sambrook et ai., supra, 

Helix-tum-helix proteins include the well studied X Cro protein, Xcl, and 
coli CAP proteins {see Steitz et al., Proc. Natl Acad, Sci. USA 79:3097-3100, 1982; 
5 Ohiendorf et al., J. Mol. BioL 169:151-169, 1983). In addition, the lac repressor 
(Kaptein et al., J. Mol. Biol 752:179-182, 1985) and Trp repressor (Scheritz et al.. 
Nature 3]7:1S2'1S6, 1985) belong to this family. Members of the homeodomain 
family include the Drosophila protein Antennapaedia (Qian et al.. Cell 59:573-580, 
1989) and yeast MATa2 (Wolberger et al.. Cell (57:517-528, 1991). Zinc fmger 

10 proteins include TFIIIA (Miller et al., EMBO J. ^: 1609-1 614, 1985), Sp-1, zif 268, and 
many others {see generally Krizek et al., J. Am. Chem. Soc. 7/3:4518-4523, 1991). 
Steroid receptor proteins include receptors for steroid hormones, retinoids, vitamin D, 
thyroid hormones, as well as other compounds. Specific examples include retinoic acid, 
knirps, progesterone, androgen, glucocosteroid and estrogen receptor proteins. The 

15 leucine zipper family was defined by a heptad repeat of leucines over a region of 30 to 
40 residues. Specific members of this family include C/EBP, c-fos., c-jun, GCN4, sis-A, 
and CREB {see generally O'Shea et al.. Science 25^:539-544, 1991). The helix-loop- 
helix (HLH) family of proteins appears to have some similarities to the leucine zipper 
family. Well-known members of this family include myoD (Weintraub et al.. Science 

20 257:761-766, 1991); c-myc\ and AP-2 (Williams and Tijan, Science 257:1067-1071, 
1991). The P-sheet family uses an antiparallel P-sheet for DNA binding, rather than the 
more common a-helix. The family contains the MetJ (Phillips, Curr. Opin. Struc, Biol 
7:89-98, 1991), Arc (Breg et al.. Nature 346:5S6-5S9, 1990) and Mnt repressors. In 
addition, other motifs are used for DNA binding, such as the cysteine-rich motif in yeast 

25 GAL4 repressor, and the GATA factor. Viruses also contain gene products that bind 
specific sequences. One of the most-studied such viral genes is the rev gene from HIV. 
The rev gene product binds a sequence called RRE (rev responsive element) found in 
the env gene. Other proteins or peptides that bind DNA may be discovered on the basis 
of sequence similarity to the known classes or fimctionally by selection. 
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Several techniques may be used to select other nucleic acid binding 
domains (see U.S. Patent No. 5,270,170; PCT Application WO 93/14108; and U.S. 
Patent No. 5,223,409). One of these techniques is phage display. (See, for example, 
U.S. Patent No. 5,223,409.) In this method, DNA sequences are inserted into the 
5 gene III or gene VIII gene of a filamentous phage, such as Ml 3. Several vectors with 
multicloning sites have been developed for insertion (McLafferty etal., Gene 128:29- 
36, 1993; Scott and Smith, Science 2^P:386-390, 1990; Smith and Scott, Methods 
Enzymol 277:228-257, 1993). The inserted DNA sequences may be randomly 
generated or variants of a known DNA-binding domain. Generally, the inserts encode 

10 from 6 to 20 amino acids. The peptide encoded by the inserted sequence is displayed 
on the surface of the bacteriophage. Bacteriophage expressing a desired nucleic acid- 
binding domain are selected for by binding to the cytocide-encoding agent. This target 
molecule may be single stranded or double stranded DNA or RNA. When the cytocide- 
encoding agent to be delivered is single-stranded, such as RNA, the appropriate target is 

15 singlerstranded. When the rnolecule to be delivered is double-stranded, the target 
molecule is preferably double-stranded. Preferably, the entire coding region of the 
cytocide-encoding agent is used as the target. In addition, elements necessary for 
transcription that are included for in vivo or in vitro delivery may be present in the 
target DNA molecule. Bacteriophage that bind the target are recovered and propagated. 

20 Subsequent rounds of selection may be performed. The final selected bacteriophage are 
propagated and the DNA sequence of the insert is determined. Once the predicted 
amino acid sequence of the binding peptide is known, sufficient peptide for use herein 
as an nucleic acid binding domain may be made either by recombinant means or 
synthetically. Recombinant means is used when the receptor-binding internalized 

25 ligand/nucleic acid binding domain is produced as a fusion protein. In addition, the 
peptide may be generated as a tandem array of two or more peptides, in order to 
maximize affinity or binding of multiple DNA molecules to a single polypeptide. 

As an example of the phage display selection technique, a DNA-binding 
domain/peptide that recognizes the coding region of saporin is isolated. Briefly, DNA 

30 fragments encoding saporin may be isolated from a plasmid containing these sequences. 
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The plasmid FPFSl contains the entire coding region of saporin. Digestion of the 
plasmid with Ncol and Ecom restriction enzymes liberates the saporin specific 
sequence as a single fragment of approximately 780 bp. This fragment may be purified 
by any one of a number of methods, such as agarose gel electrophoresis and subsequent 
5 elution from the gel. The saporin fragment is fixed to a solid support, such as in the 
wells of a 96-well plate. If the double-stranded fragment does not bind well to the plate, 
a coating such as a positively charged molecule, may be used to promote DNA 
adherence. The phage library is added to the wells and an incubation period allows for 
binding of the phage to the DNA. Unbound phage are removed by a wash, typically 
10 containing 10 mM Tris, 1 mM EDTA. and without salt or with a low salt concentration. 
Bound phage are eluted starting at a 0.1 M NaCl containing buffer. The NaCl 
concentration is increased in a step-wise fashion until all the phage are eluted. 
Typically, phage binding with higher affinity will only be released by higher salt 
concentrations. 

Eluted phage are propagated in the bacteria host. Further rounds of 
selection may be performed to select for a few phage binding with high affinity. The 
DNA sequence of the insert in the binding phage is then determined. In addition, 
peptides having a higher affinity may be isolated by making variants of the insert 
sequence and subjecting these variants to fiirther rounds of selection. 



15 



20 



C. Cvtocide-encoding agents 

A cytocide-encoding agent is a nucleic acid molecule (DNA or RNA) 
that, upon internalization by a cell, and subsequent transcription (if DNA) and[/or] 
translation into a cytocidal agent, is cytotoxic to a cell or inhibits cell growth by 
25 inhibiting protein synthesis. 

Cytocides include saporin, the ricins, abrin and other ribosome 
inactivating proteins, Pseudomonas exotoxin, diphtheria toxin, angiogenin, tritin, 
dianthins 32 and 30, momordin, pokeweed antiviral protein, mirabilis antiviral protein, 
bryodin, angiogenin, and shiga exotoxin, as well as other cytocides that are known to 
30 those of skill in the art. Alternatively, cytocide gene products may be noncytotoxic but 
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activate a compound, which is endogenously produced or exogenously appUed, from a 
nontoxic form to a toxic product that inhibits protein synthesis. 

Especially of interest are DNA molecules that encode an en2yme that 
results in cell death or renders a cell susceptible to cell death upon the addition of 
5 another product. For example, saporin is an enzyme that cleaves rRNA and inhibits 
protein synthesis. Other enzymes that inhibit protein synthesis are especially well 
suited for use in the present invention. In addition, enzymes may be used where the 
enzyme activates a compound with little or no cytotoxicity into a toxic product that 
inhibits protein synthesis. 

10 

1. Ribosome inactivating proteins 

Ribosome-inactivating proteins (RIPs), which include ricin, abrin, and 
saporin, are plant proteins that catalytically inactivate eukaryotic ribosomes. Ribosome- 
inactivating proteins inactivate ribosomes by interfering with the protein elongation step 

15 of protein synthesis. For example, the ribosome-inactivating protein saporin 
(hereinafter also referred to as SAP) has been shown to inactivate 60S ribosomes by 
cleavage of the N-glycosidic bond of the adenine at position 4324 in the rat 28S 
ribosomal RNA (rRNA). The particular region in which A4324 is located in the rRNA is 
highly conserved among prokaryotes and eukaryotes; A4324 in 288 rRNA corresponds to 

20 A2660 in E, coli 23 S rRNA. Several of the ribosome inactivating proteins also appear to 
interfere with protein synthesis in prokaryotes, such as E, coli. 

Saporin is preferred as a cytocide, but other suitable ribosome 
inactivating proteins (RIPs) and toxins may be used. Other suitable RIPs include, but 
are not limited to, ricin, ricin A chain, maize ribosome inactivating protein, gelonin, 

25 diphtheria toxin, diphtheria toxin A chain, trichosanthin, tritin, pokeweed antiviral 
protein (PAP), mirabilis antiviral protein (MAP), Dianthins 32 and 30, abrin, monordin, 
bryodin, shiga (see, e.g,, WO 93/24620) and others (see, e.g., Barbieri et al.. Cancer 
Surveys 7:489-520, 1982, and European patent application No. 0466 222, incorporated 
herein by reference, which provide lists of numerous ribosome inactivating proteins and 

30 their sources; see also U.S. Patent No. 5,248,608 to Walsh et al.). Some ribosome 
inactivating proteins, such as abrin and ricin, contain two constituent chains: a cell- 
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binding chain that mediates binding to cell surface receptors and internalization of the 
molecule and a chain responsible for toxicity. Single chain ribosome inactivating 
proteins (type I RIPS), such as the saporins, do not have a cell-binding chain. As a 
result, unless internalized, they are substantially less toxic to whole cells than the 
ribosome inactivating proteins that have two chains. 

Several structurally related ribosome inactivating proteins have been 
isolated from seeds and leaves of the plant Saponaria officinalis (soapwort) (GB Patent 
2,194,241 B; GP Patent 2,216,891; EP Patent 89306016). Saporin proteins for use in 
this invention have amino acid sequences found in the natural plant host Saponaria 
officinalis or modified sequences, having amino acid substitutions, deletions, insertions 
or additions, but which still express substantial ribosome inactivating activity. Purified 
preparations of saporin are frequently observed to include several molecular isoforms of 
the protein. It is understood that differences in amino acid sequences can occur in 
saporin from different species as well as between saporin molecules from individual 
organisms of the same species. Among these. SO-6 is the most active and abundant, 
representing 7% of total seed proteins. Saporin is very stable, has a high isoelectric' 
poim, does not contain carbohydrates, and is resistant to denaturing agents, such as 
sodium dodecyl sulfate (SDS), and a variety of proteases. The amino acid sequences of 
several saporin-6 isoforms from seeds are known, and there appear to be families of 
saporin ribosome inactivating proteins differing in few amino acid residues. Any of 
these saporin proteins or modified proteins that are cytotoxic may be used in the present 



invention. 



25 



30 



a. Isolation of DNA encoding sap orin 

Some of the DNA molecules provided herein encode saporin that has 
substantially the same amino acid sequence and ribosome inactivating activity as that of 
saporin-6 (SO-6), including any of four isoforms, which have heterogeneity at amino 
acid positions 48 and 91 {see, e.g., Maras et al., Biochem. Internal. 2/:631-638, 1990, 
and Barra et al., Biotechnol. Appl. Biochem. yj:48-53. 1991; GB Patent 2,216,891 B 
and EP Patent 89306106; and SEQ ID NOs. 19-23). Other suitable saporin 
polypeptides include other members of the multi-gene family coding for isoforms of 
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saporin-type ribosome inactivating proteins including SO-1 and SO-3 (Fordham- 
Skelton et al., Mol. Gen, Genet. 227:134-138, 1990), SO-2 {see, e.g., U.S. Application 
Serial No. 07/885,242; GB 2,216,891; see also Fordham-Skelton et al., Mol. Gen. 
Genet. 229:460-466, 1991), SO-4 {see, e.g., GB 2,194,241 B; see also Lappi et al., 
5 Biochem. Biophys. Res. Commun, 729:934-942, 1985) and SO-5 {see, e.g., GB 
2,194,241 B; see also Montecucchi et al., Int. J. Peptide Protein Res. JJ:263-267, 
1989). 

The saporin polypeptides for use in this invention include any of the 
isoforms of saporin that may be isolated from Saponaria officinalis or related species or 
10 modified forms that retain cytotoxic activity. In particular, such modified saporin may 
be produced by modifying the DNA encoding the protein {see, e.g., Intemational PCX 
Application Serial No. PCTAJS93/05702, and United States Application Serial 
No. 07/901,718; see also U.S. Patent Application No. 07/885,242, and Italian Patent 
No. 1,23 1,914) by altering one or more amino acids or deleting or inserting one or more 

15 amino acids. Any such protein, or portion thereof, that exhibits cytotoxicity in standard 
in vitro or in vivo assays within at least about an order of magnitude of the saporin 
conjugates described herein is contemplated for use herein. 

Preferably, the saporin DNA sequence contains mammalian-preferred 
codons (SEQ. ID NO. 79). Preferred codon usage as exemplified in Current Protocols 

20 in Molecular Biology, infra, and Zhang et al. {Gene 705:61, 1991) for mammals, yeast, 
Drosophila, E. coli, and primates is established for saporin sequence. 

The cytocide-encoding agent, such as saporin DNA sequence, is 
introduced into a plasmid in operative linkage to an appropriate promoter for expression * 
of polypeptides in the organism. The presently preferred saporin proteins are SO-6 and 

25 SO-4. The DNA can optionally include sequences, such as origins of replication that 
allow for the extrachromosomal maintenance of the saporin-containing plasmid, or can 
be designed to integrate into the genome of the host (as an alternative means to ensure 
stable maintenance in the host). 
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^- Nucleic acids encoding other ribosomP in a ctivatinp p rnt^^in^ .^h 
cvtocides 

In addition to saporin discussed above, other cytocides that inhibit 
protein synthesis are useful in the present invention. The gene sequences for these 
5 cytocides may be isolated by standard methods, such as PGR, probe hybridization of 
genomic or cDNA libraries, antibody screenings of expression libraries, or clones may 
be obtained from commercial or other sources. The DNA sequences of many of these 
cytocides are well known, including ricin A chain (GenBank Accession No. X02388); 
maize ribosome inactivating protein (GenBank Accession No. L26305); gelonin 
10 (GenBank Accession No. L12243; PCT Application WO 92/03155; U.S. Patent 
No.- 5,376,546; diphtheria toxin (GenBank Accession No. KOI 722); trichosanthin 
(GenBank Accession No. M34858); tritin (GenBank Accession No. D 13795); 
pokeweed antiviral protein (GenBank Accession No. X78628); mirabilis antiviral 
protein (GenBank Accession No. D90347); dianthin 30 (GenBank Accession 
15 NO.X59260); abrin (GenBank Accession No. X55667); shiga (GenBank Accession 
No. Ml 9437) and Pseudomonas exotoxin (GenBank Accession Nos. KOI 397, 
M23348). When DNA sequences or amino acid sequences are known, DNA molecules 
encoding these proteins may be synthesized, and preferably contain mammalian- 
preferred codons. 

20 

D. Prodrug-encoding a^ ent 

A nucleic acid molecule encoding a prodrug may alternatively be used 
within the context of the present invention. Prodrugs are inactive in the host cell until 
either a substrate is provided or an activating molecule is provided. Most typically, a 
25 prodrug activates a compound with little or no cytotoxicity into a toxic product. Two of 
the more often used prodrug molecules, both of which may be used in the present 
invention, are HSV thymidine kinase and £. coli cytosine deaminase. 

Briefly, a wide variety of gene products which either directly or 
indirectly activate a compound with little or no cytotoxicity into a toxic product may be 
30 utilized within the context of the present invention. Representative examples of such 
gene products include HSVTK (herpes simplex virus thymidine kinase) and VZVTK 
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(varicella zoster virus thymidine kinase), which selectively phosphorylate certain purine 
arabinosides and substituted pyrimidine compounds. Phosphoryation converts these 
compounds to metabolites that are cytotoxic or cytostatic. For example, exposure of the 
drugs ganciclovir, acyclovir, or any of their analogues (e.g., FIAU, PI AC, DHPG) to 
5 cells expressing HSVTK allows conversion of the drug into its corresponding active 
nucleotide triphosphate form. 

Other gene products that may be utilized within the context of the 
present invention include £. coli guanine phosphoribosyl transferase, which converts 
thioxanthine into toxic thioxanthine monophosphate (Besnard et al., Mol. Cell Biol 

10 7:4139-4141, 1987); alkaline phosphatase, which converts inactive phosphorylated 
compounds such as mitomycin phosphate and doxorubicin-phosphate to toxic 
dephosphorylated compounds; fungal {e.g., Fusarium oxysporum) or bacterial cytosine 
deaminase, which converts 5-fluorocytosine to the toxic compound 5-fluorouracil 
(Mullen, PNAS 5P:33, 1992); carboxypeptidase G2, which cleaves glutamic acid from 

15 para-N-bis (2-chloroethyl) aminobenzoyl glutamic acid, thereby creating a toxic 
benzoic acid mustard; and Penicillin-V amidase, which converts phenoxyacetabide 
derivatives of doxorubicin and melphalan to toxic compounds (see generally, Vrudhula 
et al., J. of Med. Chem. i5(7):919-923, 1993; Kern et al.. Cane, Immun, Immunother, 
i/(4):202-206, 1990), Moreover, a wide variety of Herpesviridae thymidine kinases, 

20 including both primate and non-primate herpesviruses, are suitable. Such herpesviruses 
include Herpes Simplex Virus Type 1 (McKnight et al., Nuc. Acids Res <S:5949-5964, 
1980), Herpes Simplex Virus Type 2 (Swain and Galloway, J. Virol ¥<5: 1045- 1050, 
1983), Varicella Zoster Virus (Davison and Scott, J. Gen. Virol 57:1759-1816, 1986), 
marmoset herpesvirus (Otsuka and Kit, Virology 7J5;3 16-330, 1984), feline herpesvirus 

25 type 1 (Nunberg et al., J. Virol 6J:3240-3249, 1989), pseudorabies virus (Kit and Kit, 
U.S. Patent No. 4,514,497, 1985), equine herpesvirus type 1 (Robertson and Whalley, 
Nuc. Acids Res. 75:1 1303-1 1317, 1988), bovine herpesvirus type 1 (Mittal and Field, J. 
Virol 70:2901-2918, 1989), turkey herpesvirus (Martin et al., J. Virol 65:2847-2852, 
1989), Marek's disease virus (Scott et ah, J. Gen. Virol 70:3055-3065, 1989), 

30 herpesvirus saimiri (Honess et al., J. Gen. Virol 70:3003-3013, 1989) and Epstein-Barr 
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virus (Baer et al.. Nature (London) 570:207-311, 1984). Such herpesviruses may be 
readily obtained from commercial sources such as the American Type Culture 
Collection ("ATCC", Rockville, Maryland). 

Furthermore, as indicated above, a wide variety of inactive precursors 
5 may be converted into active inhibitors. For example, thymidine kinase can 
phosphorylate nucleosides {e.g., AT) and nucleoside analogues such as ganciclovir (9- 
{[2-hydroxy-l-(hydroxymethyl)ethoxyl methyl} guanosine), famciclovir, buciclovir, 
penciclovir, valciclovir, acyclovir (9-[2-hydroxy ethoxy)methyl] guanosine), 
trifluorothymidine, I-[2-deoxy, 2-fluoro, beta-D-arabino furanosylJ-S-iodouracil, ara-A 
10 (adenosine arabinoside, vivarabine), 1 -beta-D-arabinofiiranoxyl thymine, 5-ethyl-2'- 
deoxyuridine, 5-iodo-5'-amino-2,5'-dideoxyuridine, idoxuridine (5-iodo-2'- 
deoxyuridine), AZT (3' azido-3' thymidine), ddC (dideoxycytidine), AIU (5-iodo-5' 
amino 2', 5'-dideoxyuridine) and AraC (cytidine arabinoside). 

'5 E. Other nucleic acid molecules 

The conjugates provided herein may also be used to deliver other types 
of nucleic acids to targeted cells. Such other nucleic acids include antisense RNA, 
antisense DNA, ribozymes, triplex-forming oligonucleotides, and oligonucleotides that 
bind proteins. The nucleic acids can also include RNA trafficking signals, such as viral 
20 packaging sequences {see, e.g., Sullenger et al. (1994) Science 262:1566-1569). The 
nucleic acids also include DNA molecules that encode proteins that replace defective 
genes, such as the gene associated with cystic fibrosis {see, e.g, PCT Application WO 
93/03709, U.S. Application Serial No. 07/745,900; and Riordan et al. (1989) Science 
2¥j: 1066- 1073). Other DNA molecules may encode tumor-specific cytotoxic 
25 molecules, such as tumor necrosis factor, viral antigens and other proteins to render a 
cell susceptible to anti-cancer agents. 

Nucleic acids and oligonucleotides for use as described herein can be 
synthesized by any method known to those of skill in this art {see, e g, WO 93/01286, 
U.S. Application Serial No. 07/723,454; U.S.. Patent No. 5,218,088; U.S. Patent No. 
30 5,175,269; U.S. Patent No. 5,109,124). Identification of oligonucleotides and 
ribozymes for use as antisense agents and DNA encoding genes for targeted delivery for 
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genetic therapy involve methods well known in the art. For example, the desirable 
properties, lengths and other characteristics of such oligonucleotides are well known. 
Antisense oligonucleotides are typically designed to resist degradation by endogenous 
nucleolytic enzymes and include, but are not limited to: phosphorothioate, 
5 methylphosphonate, sulfone, sulfate, ketyl, phosphorodithioate, phosphoramidate, 
phosphate esters, and other such linkages (see, e.g., Agrwal et al., Tetrehedron Lett. 
25:3539.3542 (1987); Miller et al., J, Am, Chem. Soc. Pi:6657-6665 (1971); Stec et al., 
Tetrehedron Lett. 25:2191-2194 (1985); Moody et aL, Nucl Acids Res. 72:4769-4782 
(1989); Uznanski et al., NucL Acids Res, (1989); Letsinger et al.. Tetrahedron 40:137- 
10 143 (1984); Eckstein, ^««w. Rev, Biochem. 5^:367-402 (1985); Eckstein, Trends Biol. 
Sci. 1 4:97-100 (1989); Stein In: Oligodeoxynucleotides. Antisense Inhibitors of Gene 
Expression, Cohen, Ed, Macmillan Press, London, pp. 97-117 (1989); Jager et al.. 
Biochemistry 27:7237-7246 (1988)). 

Antisense nucleotides are oligonucleotides that bind in a sequence- 
15 - specific manner to nucleic acids,_suchas^mRNA or DNA. When bound to mRNA that 
has complementary sequences, antisense prevents translation of the mRNA (see, e.g., 
U.S. Patent No. 5,168,053 to Altman et aL; U.S. Patent No. 5,190,931 to Inouye, U.S. 
Patent No. 5,135,917 to Burch; U.S. Patent No. 5,087,617 to Smith and Clusel et al. 
(1993) Nucl. Acids Res, 27:3405-3411, which describes dumbbell antisense 
20 oligonucleotides). Triplex molecules refer to single DNA strands that bind duplex 
DNA forming a colinear triplex molecule and thereby prevent transcription (see, e.g., 
U.S. Patent No. 5,176,996 to Hogan et al., which describes methods for making 
synthetic oligonucleotides that bind to target sites on duplex DNA). 

Particularly useful antisense nucleotides and triplex molecules are 
25 molecules that are complementary or bind to the sense strand of DNA or mRNA that 
encodes an oncogene, such as bFGF, int-2, hst-l/K-FGF, FGF-5, hst-2/FGF-6, FGF-8. 
Other useful antisense oligonucleotides include those that are specific for IL-8 (see, e.g., 
U.S. Patent No. 5,241,049; and PCX Applications WO 89/004836; WO 90/06321; WO 
89/10962; WO 90/00563; and WO 91/08483), which can be linked to bFGF for the 
30 treatment of psoriasis, anti-sense oligonucleotides that are specific for nonmuscle 
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myosin heavy chain and/or c-myb (see. e.g., Simons et al. (1 992) Circ. Res. 70:835-843; 
PCT Application WO 93/01286, U.S. application Serial No. 07/723,454: LeClerc et al. 
(1991) J. Am. Coll. Cardiol. 17 (2 Suppl A):\05A; Ebbecke et al. (1992) Basic Res. 
Cardiol 57:585-591), which can be targeted by an FGF to inhibit smooth muscle cell 
5 proliferation, such as that following angioplasty and thereby prevent restenosis or 
inhibit viral gene expression in transformed or infected cells. 

A ribozyme is an RNA molecule that specifically cleaves RNA 
substrates, such mRNA, and thus inhibits or interferes with cell growth or expression. 
There are at least five classes of ribozymes that are known that are involved in the 
10 cleavage and/or ligation of RNA chains. Ribozymes can be targeted to any RNA 
transcript and can catalytically cleave such transcript {see, e.g., U.S. Patent No. 
5,272,262; U.S. Patent No. 5,144,019; and U.S. Patent Nos. 5,168,053. 5,180,818, 
5,116,742 and 5,093,246 to Cech et al., which described ribozymes and methods for 
production thereof). Any such ribosome may be linked to the growth factor for delivery 
15 to a cell bearing a receptor for a receptor-internalized binding ligand. 

The ribozymes may be delivered to the targeted cells by DNA encoding 
the ribozyme linked to a eukaryotic promoter, such as a eukaryotic viral promoter, such 
that upon introduction into the nucleus, the ribozyme will be directly transcribed. In 
such instances, the construct will also include a nuclear translocation sequence, 
20 generally as part of the ligand or as part of a linker between the ligand and nucleic acid 
binding domain. 

DNA that encodes a therapeutic product contemplated for use includes 
DNA encoding correct copies of defective genes, such as the defective gene (CFTR) 
associated with cystic fibrosis (see, e.g.. International Application WO 93/03709, U.S. 
25 Application Serial No. 07/745,900; and Riordan et al. (1989) Science 2¥5: 1066- 1073), 
and anticancer agents, such as tumor necrosis factors. The conjugate preferably 
includes an NTS. If the conjugate is designed such that the ligand and nucleic acid 
binding domain are cleaved in the cytoplasm, then the NTS should be included in a 
portion of the conjugate or linker that remains bound to the DNA. The nuclear 
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translocation sequence (NTS) may be a heterologous sequence or a may be derived 
from the selected growth factor. 

F- Construct containing cvtocidal-encoding ag ent 
5 In the case of cytotocide molecules such as the ribosome inactivating 

proteins, very few molecules may need to be expressed to effect cell killing. Indeed, 
only a single molecule of diphtheria toxoid introduced into a cell was sufficient to kill 
the cell. With other cytocides or prodrugs, it may be that propagation or stable 
maintenance of the construct is necessary to attain a sufficient amount or concentration 
10 of the gene product for effective gene therapy. Examples of replicating and stable 
eukaryotic plasmids may be found in the scientific literature. 

In general, constructs will also contain elements necessary for 
transcription and translation. If the cytocide-encoding agent is DNA, then it must 
contain a promoter. The choice of the promoter will depend upon the cell type to be 
15 transformed and^the degree or type of control desired. Promoters can be constitutive or 
active in any cell type, tissue specific, cell specific, event specific temporal-specific or 
inducible. Cell-type specific promoters and event type specific promoters are preferred. 
Examples of constitutive or nonspecific promoters include the SV40 early promoter 
(U.S. Patent No, 5,1 18,627), the SV40 late promoter (U.S. Patent No. 5,1 18,627), CMV 
20 early gene promoter (U.S. Patent No. 5,168,062), and adenovirus promoter. In addition 
to viral promoters, cellular promoters are also amenable within the context of this 
invention. In particular, cellular promoters for the so-called housekeeping genes are 
useful. Viral promoters are preferred, because generally they are stronger promoters 
than cellular promoters. 
25 Tissue specific promoters are particularly useful when a certain tissue 

type is to be targeted for transformation. By using one of this class of promoters, an 
extra margin of specificity can be attained. For example, when the indication to be 
treated is ophthalmological (e.g., secondary lens clouding), either the alpha-crystalline 
promoter or gamma-crystalline promoter is preferred. When a tumor is the target of 
30 gene delivery, cellular promoters for specific tumor markers or promoters more active 
in tumor cells should be chosen. Thus, to treat prostate tumor, the prostate-specific 
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antigen promoter is especially useful. Similarly, the tyrosinase promoter or tyrosinase- 
related protein promoter is a preferred promoter for melanoma treatment. For treatment 
of diseases that are angiogenic or exacerbated by angiogenesis, the VEGF receptor 
promoter is preferred. The VEGF receptor is expressed in developing capillaries For 
5 treatment of breast cancer, the promoter from heat shock protein 27 is preferred- for 
treatment of colon or lung cancer, the promoter from carcinoembryonic antigen is 
preferred; for treatment of restenosis or other diseases involving smooth muscle cells 
the promoter from a-actin or myosin heavy chain is preferred. For B lymphocytes the 
immunoglobulin variable region gene promoter; for T lymphocytes, the TCR receptor 
10 vanable region promoter; for helper T lymphocytes, the CD4 promoter; for liver the 
albumin or a-fetoprotein promoter, are a few additional examples of tissue specific 
promoters. Many other examples of tissue specific promoters are readily available to 
one skilled in the art. Some of these promoters are temporally regulated, such as c-myc 
and cyclin D. 

15 Inducible promoters may also be used. These promoters include the 

MMTV LTR (PCT WO9,/13160), which is inducible by dexamethasone 
metallothionein. which is inducible by heavy metals, and promoters with cAMP 
response elements, which are inducible by cAMP. By using an inducible promoter the 
nucleic acid may be delivered to a cell and will remain quiescent until the addition of 
20 the inducer. This allows further control on the timing of production of the therapeutic 
gene. 

Event-type specific promoters are active or up-regulated only upon the 
occurrence of an event, such as tumorigenecity or viral infection. The HIV LTR is a 
well known example of an event-specific promoter. The promoter is inactive unless the 
25 tat gene product is present, which occurs upon viral infection. Another promoter is 

c-myc. 

Additionally, promoters that are coordinately regulated with a particular - 
cellular gene may be used. For example, promoters of genes that are coordinately 
expressed when a particular FGF receptor gene is expressed may be used. Then the ' 
30 nucleic acid will be transcribed when the FGF receptor, such as FGFRl. is expressed 
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and not when FGFR2 is expressed. This type of promoter is especially useful when one 
knows the pattern of FGF receptor expression in a particular tissue, so that specific cells 
within that tissue may be killed upon transcription of a cytotoxic agent gene without 
affecting the surrounding tissues. 
5 If the domain binds in a sequence specific manner, the construct must 

contain the sequence that binds to the nucleic acid binding domain. As described 
below, the target nucleotide sequence may be contained within the coding region of the 
cytocide, in which case, no additional sequence need be incorporated. Additionally, it 
may be desirable to have multiple copies of target sequence. If the target sequence is 
1 0 coding sequence, the additional copies must be located in non-coding regions of the 
cytocide-encoding agent. The target sequences of the nucleic acid binding domains are 
typically generally known. If unknown, the target sequence may be readily determined. 
Techniques are generally available for establishing the target sequence (e.g., see PCT 
Application WO 92/05285 and U.S. Serial No. 586,769). 

15 

G. Other Elements 

1. Nuclear translocation signal 

As used herein, a "nuclear translocation or targeting sequence" (NTS) is 
a sequence of amino acids in a protein that are required for translocation of the protein 

20 into a cell nucleus. Examples of NTSs are set forth in Table 2 below. Comparison with 
known NTSs, and if necessary testing of candidate sequences, should permit those of 
skill in the art to readily identify other amino acid sequences that function as NTSs. A 
heterologous NTS refers to an NTS that is different from the NTS that occurs in the 
wild-type peptide, polypeptide, or protein. For example, the NTS may be derived from 

25 another polypeptide, it may be synthesized, or it may be derived from another region in 
the same polypeptide. 
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TABLE 2 



Source 

I SV40 large T 
Polyoma large T 
Human c-Myc 
Adenovirus El A 
Yeast mat a-, 
I c-Erb-A 



FGF-1 



Sequence* 

Pro '^^LysLysArgLys ValGlu 
Pro^'^ ProLysLysAlaArgGIuVal 
Pro '^VlaAlaLysA rgValLysLeuAsp 
Lys ArgProArgPro 
Lys^IIeProIleLys 
A. Gly^^ i-ysArgLysArgLysSer 

B. Ser'^LysArgValAlaLysArgLysLeu 

C. Ser'^'HisTrpL ysGlnLysArgLysPhe 
Pro^^'Leu LeuLysLysIIeLysGin 

Pro-'^^GlnProLysLysLysPro 

Pro'^^GlyLysArgLysLysGluMetTlirLysG lnLysGluValPro 
Gly'^ArgLysLysArgArgGlnArgArgArgAlaPro 



AsnTyrLysLysProLysLeu 



HisPheLysAspProLysArg 
A iaPro Arg Arg ArgLysLeu 
IleLysArgLeuArgArg 
GlyArgArg 



^Superscript indicates position in protein 



lleLysArg GlnArgArg 
IleArgVaiArgArg 



SEQ ID I 
NO. 
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In order to deliver the nucleic acid to the nucleus, the conjugate should 
mclude an NTS. If the conjugate is designed such that the receptor-binding internalized 
hgand and linked nucleic acid binding domain is cleaved or dissociated in the 
cytoplasm, then the NTS should be included in a portion of the complex that remains 
bound to the nucleic acid, so that, upon internalization, the conjugate will be trafficked 
10 to the nucleus. Thus, the NTS is preferably included in the nucleic acid binding 
domam, but may additionally be included in the ligand. An NTS is prefeired if the 
cytocide-encoding agent is DNA. If the cytocide-encoding agent is mRNA, an NTS 
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may be omitted. The nuclear translocation sequence (NTS) may be a heterologous 
. sequence or a may be derived from the selected growth factor. All presently identified 
members of the FGF family of peptides contain an NTS {see, e,g.. International 
Application WO 91/15229 and Table 2). A typical consensus NTS sequence contains 
5 an amino-terminal proline or glycine followed by at least three basic residues in a array 
of seven to nine amino acids (see, e.g.. Dang et al., J. Biol Chem, 2^^:18019-18023, 
1989; Dang et al., MoL Cell Biol 5:4049-4058, 1988, and Table 2). 

2. Cytoplasm-translocation signal 

10 Cytoplasm-translocation signal sequence is a sequence of amino acids in 

a protein that cause retention of proteins in the lumen of the endoplasmic reticulum 
and/or translocate proteins to the cytosol. The signal sequence in mammalian cells is 
KDEL (Lys-Asp-Glu-Leu) (SEQ ID NO. 42) (Munro and Pelham, Cell 48:899-907, 
1987), Some modifications of this sequence have been made without loss of activity. 

15 For _example, the sequences RDEL (Arg-Asp-Glu-Leu) (SEQ ID NO. 43) and KEEL 
(Lys-Glu-Glu-Leu) (SEQ ID NO. 44) confer efficient or partial retention, respectively, 
in plants (Denecke et al., Embo, J. 77:2345-2355, 1992). 

A cytoplasm-translocation signal sequence may be included in either the 
receptor-internalized binding ligand or the nucleic acid binding domain part or both. If 

20 cleavable linkers are used to link the ligand with the nucleic acid binding domain, the 
cytoplasm-translocation signal is preferably included in the nucleic acid binding 
domain, which will stay bound to the cytocide-encoding agent. Additionally, a 
cytoplasmic-translocation signal sequence may be included in the receptor-internalized 
binding ligand, as long as it does not interfere with receptor binding. Similarly, the 

25 signal sequence placed in the nucleic acid binding domain should not interfere with 
binding to the cytocide-encoding agent. 

3. Endbsome-disruptive peptides 

In addition, or alternatively, membrane-disruptive peptides may be 
30 incorporated into the complexes. For example, adenoviruses are known to enhance 
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disruption of endosomes. Virus-free viral proteins, such as influenza virus 
hemagglutinin HA-2, also disrupt endosomes and are useful in the present invention. 
Other proteins may be tested in the assays described herein to find specific endosome 
disrupting agents that enhance gene delivery. In general, these proteins and peptides are 
5 amphipathic (see Wagner et al.. Adv. Drug. Del. Rev 7^:1 13-135, 1994). 

Endosome-disruptive peptides, sometimes called fusogenic peptides, 
may be incorporated into the complex of receptor-internalized binding ligand, nucleic 
acid binding domain, and cytocide-encoding agent. Two such peptides derived from 
influenza virus are: GLFEAIEGFIENGWEGMIDGGGC (SEQ. ID NO 45) and 
10 GLFEAIEGFIENGWEGMIDGWYGC (SEQ. ID NO. 46). Other peptides useful for 
disrupting endosomes may be identified by general characteristics: 25-30 residues in 
length, contain an alternating pattern of hydrophobic domains and acidic domains, and 
at low pH (e.g., pH 5) from amphipathic a-helices. A candidate endosome-disrupting 
peptide is tested by incorporating it into the complex and determining whether it 
15 increases the total number of cells expressing the target gene. The peptides are added to 
a complex having excess negative charge. For example, a DNA construct is complexed 
with an FGF-poly-L-lysine chemical conjugate so that only a portion of the negative 
charge of the DNA is neutralized. Poly-L-lysine is added to further bind the DNA and a 
fusogenic peptide is then added. Optional ratios of DNA, poly-L-lysine and fiisogenic 
20 peptide are determined using assays, such as gene expression and cell viability. 

The fusogenic peptides may alternatively be incorporated into the 
complex as a ftision protein with either the ligand or the nucleic acid binding domain or 
boA. The endosome-disruptive peptide may be present as single or multiple copies at 
the N- or C- terminus of the ligand. A single fiision protein of the endosome-disruptive 
25 peptide, nucleic acid binding domain, and receptor-internalized binding ligand may be 
constructed and expressed. For insertion into a construct, DNA encoding the 
endosome-disruptive peptide may be synthesized by PGR using overiapping 
oligonucleotides and incorporating a restriction site at the 5' and 3' end to facilitate 
cloning. The sequence may be verified by sequence analysis. 

30 
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4. Linkers 

As used herein, a "linker" is an extension that Hnks the receptor-binding 
internalized ligand or fragment thereof and the nucleic acid binding domain. In certain 
instances, the linker is used to conjugate the ligand directly to the nucleic acid. The 
5 linkers provided herein confer specificity, enhance intracellular availability, serum 
stability and/or solubility on the conjugate and may serve to promote condensation of 
the nucleic acid. 

The linkers provided herein confer specificity and serum stability on the 
cytotoxic conjugate, for example, by conferring specificity for certain proteases, 
10 particularly proteases that are present in only certain subcellular compartments or that 
are present at higher levels in tumor cells than normal cells. Specificity for proteases 
present in intracellular compartments and absent in blood is particularly preferred. The 
linkers may also include sorting signals that direct the conjugate to particular 
intracellular loci or compartments. Additionally, the linkers may reduce steric 

15 hindrance between ttie growth factor and other protein, or linked . nucleic _acid by_ 
distancing the components of the conjugate. Linkers may also condense the nucleic 
acid. For this purpose, the linker comprises highly basic amino acids (e.g,, Lys, Arg) 
and may even by poly-L-lysine. 

In order to increase the serum stability, solubility and/or intracellular 

20 concentration or condense the targeted agent, one or more linkers (are) inserted between 
the receptor-binding internalized ligand and the nucleic acid binding domain. These 
linkers include peptide linkers, such as intracellular protease substrates, and chemical 
linkers, such as acid labile linkers, ribozyme substrate linkers and others. Peptides 
linkers may be inserted using heterobifunctional reagents, described below, or, 

25 preferably, are linked to FGF, other growth factors, including heparin-binding growth 
factors, or cytokines by linking DNA encoding the ligand to the DNA encoding the 
nucleic acid binding domain. 

Chemical linkers may be inserted by covalently coupling the linker to the 
FGF, other growth factor protein, or cytokine and the nucleic acid binding domain. The 

30 linker may be bound via the N- or C-terminus or an internal residue. The 
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heterobiftmctional agents, described below, may be used to effect such covalent 
coupling. 

a. Protease substrates 
5 Peptides encoding protease-specific substrates may be introduced 

between the ligand and the nucleic acid binding domain. The peptides may be inserted 
using heterobiftmctional reagents, as described below, or preferably inserted by 
recombinant means and expression of the resulting chimera. 

Any protease specific substrate (see, e.g,, O'Hare et al., FEES 273:200- 
10 204, 1990; Forsberg et al., J, Protein Chem, 70:517-526, 1991; Westby et al., 
Bioconjugate Chem. 5:375-381, 1992) may be introduced as a linker as long as the 
substrate is cleaved in an intracellular compartment. Preferred substrates include those 
that are specific for proteases that are expressed at higher levels in tumor cells, that are 
preferentially expressed in the endosome, or that are absent in blood. The following 
15 substrates are among those contemplated for use in accord with the methods herein: 
cathepsin B substrate, cathepsin D substrate, trypsin substrate, thrombin substrate, and 
recombinant subtilisin substrate. 

b. Flexible li nkers and linkers that increase the solubility of the 
20 conjugates 

Flexible linkers, which reduce steric hindrance, and linkers that increase 

solubility of the conjugates are contemplated for use, either alone or with other linkers, 

such as the protease specific substrate linkers. Typically, these linkers are simple 

polymers of small amino acids (/>., small side groups) with uncharged polar side 

25 groups. These amino acids (Gly, Ser, Thr, Cys, Tyr, Asn, Gin) are more soluble in 

water. Of these amino acids, Gly and Ser are preferred. Such linkers include, but are 

not limited to, (Gly^Ser)^, (Ser,Gly)„ and (AlaAlaProAla)„ in which n is 1 to 6, 

preferably 1-4, such as: 

a. Gly^Ser SEQ ID NO: 47 
30 CCATGGGCGG CGGCGGCTCT GCCATGG 
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b. (Gly4Ser)2 SEQ ID NO: 48 
CCATGGGCGG CGGCGGCTCT 6GCGGCGGCG GCTCTGCCAT GG 

c. (Ser^Gly)^ SEQ ID NO: 49 

CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG GGCTCGTCGT 
5 CGTCGGGCGC CATGG 

d. (Ser4Gly)2 SEQ ID NO: 50 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 

e. (A1aAlaProAla)„. where n 1s 1 to 4. preferably 2 (see 
SEQ ID NO: 51) 

10 

c. Heterobifiinctional cross-linking reagents 

Numerous heterobifiinctional cross-linking reagents that are used to form 
covalent bonds between amino groups and thiol groups and to introduce thiol groups 
into proteins, are known to those of skill in this art (see, e.g., the PIERCE CATALOG, 

15 ImmunoTechnology Catalog & Handbook, 1992-1993, which describes the preparation 
of and use of such reagents and provides a commercial source for such reagents; see 
also, e:g.. Cumber et-al., Bioconjugate Chem. J:397-401r 1992; Thorpe et a]., Cowcer 
Res. ^7:5924-5931, 1987; Gordon et al., Proc. Natl Acad Sci 5^:308-312, 1987; 
Walden et al., J. Mol Cell Immunol 2:191-197, 1986; Carlsson et al., Biochem, J. 

20 /7i:723-737, 1978; Mahan et al., Anal Biochem. 7(52:163-170, 1987; Wawryznaczak 
etal., Br. J. Cancer 55:361-366, 1992; Fattom et al.. Infection & Immun. 50:584-589, 
1992). These reagents may be used to form covalent bonds between the receptor- 
binding internalized ligands with protease substrate peptide linkers and nucleic acid 
binding domain. These reagents include, but are not limited to: N-succinimidyl-3-(2- 

25 pyridyldithio)propionate (SPDP; disulfide linker); sulfosuccinimidyl 6-[3-(2- 
pyridyldithio)propionamido]hexanoate (suIfo-LC-SPDP); succinimidyloxycarbonyl-a- 
methyl benzyl thiosulfate (SMBT, hindered disulfate linker); succinimidyl 6-[3-(2- 
pyridyldithio) propionamidojhexanoate (LC-SPDP); sulfosuccLnimidyl 

4-(N-maIeimidomethyl)cyclohexane- 1 -carboxy late (sulfo-SMCC); succinimidyl 

30 3-(2-pyridyIdithio)butyrate (SPDB; hindered disulfide bond linker); sulfosuccinimidyl 
2-(7-a2ido-4-methylcoumarin-3-acetamide) ethyl- 1 ,3'-dithiopropionate (S AED); 
sulfosuccinimidyl 7-a2ido-4-methylcoumarin-3 -acetate (SAMCA); sulfosuccinimidyl 
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6-[alpha-methyl-alpha-(2-pyridyldithio)toluamido]hexanoate (sulfo-LC-SMPT); 
1 ,4-di-[3'-(2'-pyridyldithio)propionamido]butane (DPDPB)- 
4-succiniinidyloxycarbonyl- -methyl- -(2-pyridylthio)toluene (SMPT, hindered disulfate 
linker); sulfosuccinimidyl6[ -methyl- -(2-pyridyldithio)toluamido]hexanoate (suIfo-LC- 
SMPT); m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS); m-maleimidobenzoyl- 
N-hydroxysulfosuccinimide ester (sulfo-MBS); N-succinimidyI(4- 

iodoacetyl)aminobenzoate (SIAB; thioether linker); sulfosuccinimidyl(4-iodoacetyl)amino 
benzoate (sulfo-SIAB); succinimidyl4(p.maleimidophenyl)butyrate (SMPB); sulfosuccini- 
midyI4-(;vmaleimidophenyl)butyrate (sulfo-SMPB); azidobenzoyl hydrazide (ABH). 

These linkers should be particularly usefial when used in combination 
with peptide linkers, such as those that increase flexibility. 



d. Acid cleavable. phntnc leavable. and heat sensitive linkers 
Acid cleavable linkers include, but are not limited to, 
15 bismaleimideothoxy propane, adipic acid dihydrazide linkers (see, e.g., Fattom et al.. 
Infection & Immun. (50:584-589, 1992) and acid labile transferrin conjugates that 
contain a sufficient portion of transferrin to permit entry into the intracellular transferrin 
cycling pathway {see, e.g., Welhoner et al., J. Biol. Chem. 266:4309-4314, 1991). 
Conjugates linked via acid cleavable linkers should be preferentially cleaved in acidic 
20 intracellular compartments, such as the endosome. 

Photocleavable linkers are linkers that are cleaved upon exposure to light 
{see, e.g, Goldmacher et al., Bioconj. Chem. 3:104-107, 1992), thereby releasing the 
targeted agent upon exposure to light. (Hazum et al., Proc. Eur. Pept. Symp., 16th, 
Brunfeldt, K (Ed), pp. 105-110, 1981; nitrobenzyl group as a photocleavable protective 
25 group for cysteine; Yen et al.. Makromol. Chem 7P0:69-82, 1989; water soluble 
photocleavable copolymers, including hydroxypropylmethacrylamide copolymer, 
glycine copolymer, fluorescein copolymer and mediylrhodamine copolymer; and Senter 
et al., Photochem. Photobiol. ¥2:231-237, 1985; nitrobenzyloxycarbonyl chloride cross 
linking reagents that produce photocleavable linkages). Such linkers are particularly 
30 useful in treating dermatological or ophthalmic conditions and other tissues, such as 
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blood vessels during angioplasty in the prevention or treatment of restenosis, that can be 
exposed to light using fiber optics. After administration of the conjugate, the eye or skin 
or other body part can be exposed to light, resulting in release of the targeted moiety 
from the conjugate. This should permit administration of higher dosages of such 
5 conjugates compared to conjugates that release a cytotoxic agent upon internalization. 
. Heat sensitive linkers would also have similar applicability. 

H. Expression vectors and host cells for expression of receptor-binding internalized 
ligands and nucleic acid binding domains 
10 Host organisms include those organisms in which recombinant 

production of heterologous proteins have been carried out, such as bacteria (for 
example, E, coli\ yeast (for example, Saccharomyces cerevisiae and Pichia pastoris\ 
mammalian cells, and insect cells. Presently preferred host organisms are £. coli 
bacterial strains. 

15 The^DN A construct encoding the desired protein is introduced into a 

plasmid for expression in an appropriate host. In preferred embodiments, the host is a 
bacterial host. The sequence encoding the ligand or nucleic acid binding domain is 
preferably codon-optimized for expression in the particular host. Thus, for example, if 
human FGF-2 is expressed in bacteria, the codons would be optimized for bacterial 

20 usage. For small coding regions the gene can be synthesized as a single 
oligonucleotide. For larger proteins, splicing of multiple oligonucleotides, mutagenesis, 
or other techniques known to those in the art may be used. For example, the sequence 
of a bacterial-codon preferred FGF-SAP ftision is shown in SEQ. ID NO. 80. The 
sequences of nucleotides in the plasmids that are regulatory regions, such as promoters 

25 and operators, are operationally associated with one another for transcription. The 
sequence of nucleotides encoding the growth factor or growth factor-chimera may also 
include DNA encoding a secretion signal, whereby the resulting peptide is a precursor 
protein. The resulting processed protein may be recovered from the periplasmic space 
or the fermentation medium. 

30 In preferred embodiments, the DNA plasmids also include a transcription 

terminator sequence. As used herein, a "transcription terminator region" has either (a) a 
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subsegment that encodes a polyadenylation signal and polyadenylation site in the 
transcript, and/or (b) a subsegment that provides a transcription termination signal that 
terminates transcription by the polymerase that recognizes the selected promoter. TTie 
entire transcription terminator may be obtained from a protein-encoding gene, which 
5 may be the same or different from the inserted gene or the source of the promoter. 
Transcription terminators are optional components of the expression systems herein, but 
are employed in preferred embodiments. 

The plasmids used herein include a promoter in operable association 
with the DNA encoding the protein or polypeptide of interest and are designed for 
10 expression of proteins in a bacterial host. It has been found that tighdy regulatable 
promoters are prefeired for expression of saporin. Suitable promoters for expression of 
proteins and polypeptides herein are widely available and are well known in the art. 
Inducible promoters or constitutive promoters that are linked to regulatory regions are 
preferred. Such promoters include, but are not limited to, the T7 phage promoter and 
1 5 other T7-like phage promoters, such as the T3, T5 and SP6 promoters, the trp, Ipp, and 
lac promoters, such as the lacUV5, from E. coli; the PIO or polyhedron gene promoter 
of baculovirus/insect cell expression systems (see, e.g., U.S. Patent Nos. 5,243 041 
5,242,687, 5,266,317, 4,745,051, and 5,169,784) and inducible promoters from other 
eukaryotic expression systems. For expression of the proteins such promoters are 
20 inserted in a plasmid in operative linkage with a control region such as the lac operon. 

Preferred promoter regions are those that are inducible and fiinctional in 
E. coli. Examples of suitable inducible promoters and promoter regions include, but are 
not limited to: the E coli lac operator responsive to isopropyl p 
-D-thiogaiactopyranoside (IPTG; see, et al. Nakamura et al.. Cell 18:\ 109-1 117, 1979); 
the metallothionein promoter metal-regulatoiy-elements responsive to heavy-metal 
{e.g., zinc) induction {see, e.g., U.S. Patent No. 4,870,009 to Evans et al.); the phage 
T7Iac promoter responsive to IPTG {see, e g , U.S. Patent No. 4,952,496; and Studier et 
al., Meth. Enzymol 185:60-89, 1990) and the TAG promoter. 

The plasmids also preferably include a selectable marker gene or genes " 
30 that are fimctional in the host. A selectable marker gene includes any gene that confers 



25 



wo 96/36362 



PCTAJS96/07164 



51 

a phenotype on bacteria that allows transformed bacterial cells to be identified and 
selectively grown from among a vast majority of untransformed cells. Suitable 
selectable marker genes for bacterial hosts, for example, include the ampicillin 
resistance gene (Amp''), tetracycline resistance gene (Tc*") and the kanamycin resistance 
5 gene (Kan*"). The kanamycin resistance gene is presently preferred. 

The plasmids may also include DNA encoding a signal for secretion of 
the operably linked protein. Secretion signals suitable for use are widely available and 
are well known in the art. Prokaryotic and eukaryotic secretion signals ftmctional in 
E, coli may be employed. The presently preferred secretion signals include, but are not 

10 limited to, those encoded by the follovdng £. coli genes: ompA, ompT, ompF, ompC, 
beta-lactamase, and alkaline phosphatase, and the like (von Heijne, J. Mol. BioL 
75^:99-105, 1985). In addition, the bacterial pelB gene secretion signal (Lei et aL, J. 
Bacteriol 7(5P:4379, 1987), the phoA secretion signal, and the cek2 functional in insect 
cell may be employed. The most preferred secretion signal is the E, coli ompA 

15 secretion signal. Other prokaryotic and eukaryotic secretion signals known to those of 
skill in the art may also be employed {see, e.g., von Heijne, J. Mol BioL 184:99-105, 
1985). Using the methods described herein, one of skill in the art can substitute 
secretion signals that are functional in either yeast, insect or mammalian cells to secrete 
proteins from those cells. 

20 Particularly preferred plasmids for transformation of E. coli cells include 

the pET expression vectors (see U.S patent 4,952,496; available from Novagen, 
Madison, WI; see also literature published by Novagen describing the system). Such 
plasmids include pET 1 1 a, which contains the T71ac promoter, T7 terminator, the 
inducible E. coli lac operator, and the lac repressor gene; pET 12a-c, which contains the 

25 T7 promoter, T7 terminator, and the E. coli ompT secretion signal; and pET 15b 
(Novagen, Madison, WI), which contains a His-Tag™ leader sequence for use in 
purification with a His column and a thrombin cleavage site that pennits cleavage 
following purification over the column, the T7-lac promoter region and the T7 
terminator. 
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Other preferred plasmids include the pKK plasmids, particularly pKK 
223-3, which contains the tac promoter, (available from Pharmacia; see also Brosius et 
al., Proc. Natl Acad. Sci. 81:6929, 1984; Ausubel et al.. Current Protocols in 
Molecular Biology, U.S. Patent Nos. 5,122,463, 5,173,403, 5,187,153 5 204 254 
5 5,212,058, 5,212,286, 5,215,907, 5,220,013, 5,223,483, and 5,229,279). Plasmid pKK 
has been modified by replacement of the ampicillin resistance marker gene by 
digestion with Ecom, with a kanamycin resistance cassette with £coRI sticky Inds 
(purchased from Pharmacia; obtained from pUC4K, e.g., Vieira et al {Gene 
;P:259-268, 1982; and U.S. Patent No. 4,719.179). Baculovirus vectors, such as 
1 0 pBlueBac (also called pJVETL and derivatives thereof), particularly pBlueBac UI, (..., 
e.g., U.S. Patent Nos. 5,278,050, 5,244,805, 5,243,041, 5,242,687, 5,266 317' 
4,745,051, and 5,169,784; available from Invitrogen, San Diego) may also be used for 
expression of the polypeptides in insect cells. The pBIueBacIII vector is a dual 
promoter vector and provides for the selection of recombinants by blue/white screening 
1 5 as this plasmid contains the P-galactosidase gene (lacZ) under the control of the insect 
recognizable ETL promoter and is inducible with IPTG. A DNA construct may be 
made in baculovirus vector pBluebac III and then co-transfected with wild type virus 
into insect cells Spodoptera frugiperda (sf9 cells; see, e.g., Luckow et al.. 
Bio/technology 6A1-S5, 1 988, and U.S. Patent No. 4,745,05 1). 

plasmids include the pIN-IIIompA plasmids {see U.S. Patent 
No. 4,575,013; see also Dufifaud et al., Meth. Enz. 755:492-507, 1987), such as pIN- 
IIIompA2. The pIN-IIIompA plasmids include an insertion site for heterologous DNA 
linked in transcriptional reading frame with four functional fragments derived from the 
lipoprotein gene of E coli. The plasmids also include a DNA fragment coding for the 
25 signal peptide of the ompA protein of E coli, positioned such that the desired 
polypeptide is expressed with the ompA signal peptide at its amino terminus, thereby 
allowing efficient secretion across the cytoplasmic membrane. The plasmids further 
include DNA encoding a specific segment of the E coli lac promoter-operator, which is 
positioned in the proper orientation for transcriptional expression of the desired 
30 polypeptide, as well as a separate ftinctional E coli lad gene encoding the associated 
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repressor molecule that, in the absence of lac operon inducer, interacts with the lac 
promoter-operator to prevent transcription therefrom. Expression of the desired 
polypeptide is under the control of the lipoprotein (Ipp) promoter and the lac 
promoter-operator, although transcription from either promoter is normally blocked by 
the repressor molecule. The repressor is selectively inactivated by means of an inducer 
molecule thereby inducing transcriptional expression of the desired polypeptide from 
both promoters. 

Preferably, the DNA fragment is replicated in bacterial cells, preferably 
in E. coll. The preferred DNA fragment also includes a bacterial origin of replication, 
to ensure the maintenance of the DNA fragment from generation to generation of the 
bacteria. In this way, large quantities of the DNA fragment can be produced by 
replication in bacteria. Preferred bacterial origins of replication include, but are not 
limited to, the fl-ori and col El origins of replication. Preferred hosts contain 
chromosomal copies of DNA encoding T7 RNA polymerase operably linked to an 
inducible promoter, such as the lacUV promoter {see U.S. Patent No. 4,952,496). Such 
hosts include, but are not limited to, lysogens E. coli strains HMS174(DE3)pLysS, 
BL21(DE3)pLysS, HMS174(DE3) and BL21(DE3). Strain BL21(DE3) is preferred. 
The pLys strains provide low levels of T7 lysozyme, a natural inhibitor of T7 RNA 
polymerase. 

The DNA fragments provided may also contain a gene coding for a 
repressor protein. The repressor protein is capable of repressing the transcription of a 
promoter that contains sequences of nucleotides to which the repressor protein binds. 
The promoter can be derepressed by altering the physiological conditions of the cell. 
For example, the alteration can be accomplished by adding to the growth medium a 
molecule that inhibits the ability to interact with the operator or with regulatory proteins 
or other regions of the DNA or by altering the temperature of the growth media. 
Preferred repressor proteins include, but are not limited to the £. coli lad repressor 
responsive to IPTG induction, the temperature sensitive X cI857 repressor, and the like. 
The E. coli lad repressor is preferred. 
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DNA encoding fiill-length FGF-2 or FGF-2 mutein is linked to DNA 
encoding an nucleic acid binding domain, such as protamine, and introduced into the 
pET vectors, including pET-lla and pET-12a expression vectors (Novagen, Madison 
WI), for intracellular and periplasmic expression, respectively, of FGF-protamine fusion 
5 proteins. 



Preparation of complexe s contmn ing recentnr-hind ing intem^lirpH 
li^ands/nucleic acid hindinp domain roniugate. .nH >>y t..He.encoH,na .p.„te 

Within the context of this invention, specificity of delivery is achieved 
10 through the ligand. Typically, a nucleic acid binding domain is coupled to a receptor- 
bmding internalized ligand, either by chemical conjugation or as a fusion protein. As 
described below, the ligand may alternatively be coupled directly to the nucleic acid and 
then complexed with a nucleic acid binding protein, such as poly-lysine, which serves 
to condense the nucleic acid. Linkers as described above may optionally be used. The 
15 receptor-binding internalized ligand confers specificity of delivery in a cell-specific 
mamier. The choice of the receptor-binding internalized ligand to use will depend upon 
the receptor expressed by the target cells. TT.e receptor type of the target cell population 
may be determined by conventional techniques such as antibody staining, PGR of 
cDNA using receptor-specific primers, and biochemical or fimctional receptor binding 
20 assays. It is preferable that the receptor be cell type-specific or have increased 
expression or activity (i.e., higher rate of internalization) within the target cell 
population. 

As described herein, the nucleic acid binding domain can be of two 
types, non-specific in its ability to bind nucleic acid, or highly specific so that the amino 

25 acid residues bind only the desired nucleic acid sequence. Nonspecific binding 
proteins, polypeptides, or compounds are generally polycationic or highly basic. Lys 
and Arg are the most basic of the 20 common amino acids; proteins emiched for these 
residues are candidates for nucleic acid binding domains. Examples of basic proteins 
include histones, protamines, and repeating units of lysine and arginine. Poly-L-lysine 

30 is an often-used nucleic acid binding domain (see U.S. Patent Nos. 5,166,320 and 
5,354,844). Poly-L-Iysine and protamine are preferred. Other polycations, such as 
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spermine and spermidine, may also be used to bind nucleic acids. By way of example, 
the sequence-specific proteins, including gal4, Sp-1, AP-1, myoD and the rev gene 
product from HIV, may be used. Specific nucleic acid binding domains can be cloned 
in tandem, individually, or multiply to a desired region of the receptor-binding 
5 internalized ligand of interest. Alternatively, the ligand and binding domain can be 
, chemically conjugated to each other. 

The corresponding sequence that binds a sequence-specific domain is 
incorporated into the construct to be delivered. Complexing the cytocidal-encoding 
agent to the receptor-binding intemalized ligand/nucleic acid binding domain allows 

10 specific binding to the nucleic acid binding domain. Even greater specificity of binding 
may be achieved by identifying and using the minimal amino acid sequence that binds 
to the cytocidal-encoding agent of interest. For example, phage display methods can be 
used to identify amino acids residues of varying length that will bind to specific nucleic 
acid sequences with high affinity. (See U.S. Patent No. 5,223,409.) The peptide 

15 sequence can then .be cloned into the receptorrbinding_ intemalized_ligand_ as a single^ 
copy or multiple copies. Alternatively, the peptide may be chemically conjugated to the 
receptor-binding intemalized ligand. Incubation of the cytocide-encoding agent with 
the conjugated proteins will result in a specific binding between the two. 

These complexes may be used to deliver nucleic acids that encode 

20 saporin, other cytocidal proteins, or prodrugs into cells with appropriate receptors that 
are expressed, over-expressed or more active in internalization upon binding. The 
cytocide gene is cloned downstream of a mammalian promoter such as Q-myc, SV40 
early or late gene, CMV-IE, TK or adenovirus promoter. As described above, 
promoters of interest may be active in any cell type, active only in a tissue-specific 

25 manner, such as a-crystalline or tyrosinase, event specific, or inducible, such as the 
MMTV LTR. 
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1- Chemical conjugation 

2- Preparatio n of receptor-binding internalized lif >flnH<; 
Receptor-binding internalized ligands are prepared as discussed by any 
suitable method, including recombinant DNA technology, isolation from a suitable 
source, purchase from a commercial source, or chemical synthesis. The selected linker 
or linkers is (are) linked to the receptor-binding internalized ligands by chemical 
reaction, generally relying on an available thiol or amine group on the receptor-binding 
internalized ligands. Heterobifunctional linkers are particularly suited for chemical 
conjugation. Alternatively, if the linker is a peptide linker, then the receptor-binding 
internalized ligands, linker and nucleic acid binding domain can be expressed 
recombinantly as a fiision protein. 

Any protein that binds and internalizes through a receptor interaction 
may be used herein. In particular, any member of the FGF family of peptides or portion 
thereof that binds to an FGF receptor and internalizes a linked agent may be used 
herein. For the chemical conjugation methods the protein may be produced 
recombinantly, produced synthetically or obtained from commercial or other sources. 
For the preparation of ftision proteins, the DNA encoding the FGF may be obtained 
from any known source or syndiesized according to its DNA or amino acid sequences 
(see discussion above). 

Although any of the growth factors may be conjugated in this manner, 
FGF, VEGF, and HBEGF conjugation are discussed merely by way of example and not 
by way of limitation. 

If necessary or desired, the heterogeneity of preparations of ligand (e.g., 
FGF) containing chemical conjugates and fiision proteins can be reduced by modifying 
the ligand by deleting or replacing a site(s) that causes the heterogeneity. Such sites in 
FGF are typically cysteine residues that upon folding of the protein remain available for 
interaction with other cysteines or for interaction widi more than one cytotoxic 
molecule per molecule of FGF peptide. Thus, such cysteine residues do not include any 
cysteine residue that is required for proper folding of the FGF peptide or for binding to 
30 an FGF receptor and internalization. For chemical conjugation, one cysteine residue 
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that in physiological conditions is available for interaction is not replaced but is used as 
the site for linking the cytotoxic moiety. The resulting modified FGF is thus conjugated 
with a single species of nucleic acid binding domain (or nucleic acid). 

The polypeptide reactive with an FGF receptor may be modified by 
5 removing one or more reactive cysteines that are not required for receptor binding, but 
that are available for reaction with appropriately derivatized cytotoxic agent, so that the 
resulting FGF protein has only one cysteine residue available for conjugation with the 
cytotoxic agent. If necessary, the contribution of each cysteine to the ability to bind to 
FGF receptors may be determined empirically. Each cysteine residue may be 
10 systematically replaced with a conservative amino acid change {see Table 1, above) or 
deleted. The resulting mutein is tested for the requisite biological activity, the ability to 
bind to FGF receptors and internalize linked cytotoxic moieties. If the mutein retains at 
least 50% of wild-type activity, then the cysteine residue is not required. Additional 
cysteines are systematically deleted and replaced and the resulting muteins are tested for 
15 activity. In this_ manner the minimum number and identity of the cysteines needed to 
retain the ability to bind to an FGF receptor and internalize may be determined. The 
resulting mutant FGF is then tested for retention of the ability to target a cytotoxic agent 
to a cell that expresses an FGF receptor and to intemalize the cytotoxic agent into such 
cells. Retention of proliferative activity is indicative, though not definitive, of the 
20 retention of such activities. Proliferative activity may be measured by any suitable 
proliferation assay, such as the assay, exemplified below, that measures the increase in 
cell number of bovine aortic endothelial cells. 

It is noted, however, thait modified or mutant FGFs may exhibit reduced 
or no proliferative activity, but may be suitable for use herein, if they retain the ability 
25 to target cytoci de-encoding agent to cells bearing FGF receptors and result in 
internalization. Certain residues of FGF-2 have been associated with proliferative 
activity. Modification of these residues arg 116, lys 1 19, tyr 120, trp 123 to ile 1 16, glu 
119, ala 120, ala 123 may be made individually (see SEQ ID NOs. 81-84) to remove 
this function. The resulting protein is tested for proliferative activity by a standard 
30 assay. 
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Any of FGF-1 - FGF-9 may be used. The complete amino acid sequence 
of each of FGF-I - FGF- 9 is known (see, e.g., SEQ ID NO. 10 (FGF-1) and SEQ ID 
NOs. 12-18 (FGF-3 - FGF-9. respectively)). Comparison among the amino acid 
sequences of FGF-1 -FGF-9 reveals that one Cys is conserved among FGF family of 
5 peptides (see Table 3). These cysteine residues may be required for secondary structure 
and are not preferred residues to be altered. Each of the remaining cysteine residues 
may be systematically deleted and/or replaced by a serine residue or other residue that 
would not be expected to alter the structure of the protein. The resulting peptide is 
tested for biological activity. If the cysteine residue is necessary for- retention of 
1 0 biological activity it is not deleted; if it not necessary, then it is preferably replaced with 
a serine or other residue that should not alter the secondary structure of the resulting 
protein. 

The cysteine residues from each of FGF-1 - FGF-9 that appear to be 
essential for retention of biological activity and that are not preferred residues for 
deletion or replacement are as follows: 



15 



TABLE 3 



20 



FGF-1 


cys'' 


FGF-2 


cys"" 


FGF-3 


cys'" 


FGF-4 


cys'" 


FGF-5 


cys"" 


FGF-6 


cys"" 


FGF-7 


cys'^' 


FGF-8 


cys'" 


FGF-9 


cys'^ 



For example, FGF-1 has cysteines at positions 31, 98 and 132; FGF-2 
has cysteines at positions 34, 78, 96 and 101; FGF-3 has cysteines at positions 50 and 
1 15; FGF-4 has cysteines at positions 88 and 155; FGF-5 has cysteines at positions 19, 
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93, 160 and 202; FGF-6 has cysteines at positions 80 and 147; FGF-7 has cysteines at 
positions 18, 23, 32, 46, 71, 133 and 137; FGF-8 has cysteines at positions 10, 19, 109 
and 127; and FGF-9 has cysteines at positions 68 and 134. 

Since FGF-3, FGF-4 and FGF-6 have only two cysteines, for purposes of 
5 chemical conjugation, preferably neither cysteine is deleted or replaced, unless another 
residue, preferably one near either terminus, is replaced with a cysteine. With respect to 
the other FGF family members, at least one cysteine must remain available for 
conjugation with the cytotoxic conjugate and probably two cysteines, but at least the 
cysteine residues set forth in Table 3. A second cysteine may be required to form a 

10 disulfide bond. Thus, any FGF peptide that has more than three cysteines is be 
modified for chemical conjugation by deleting or replacing the other cysteine residues. 
FGF peptides that have three cysteine residues are modified by elimination of one 
cysteine, conjugated to a cytotoxic moiety and tested for the ability to bind to FGF 
receptors and internalize the cytotoxic moiety, 

15 In accord -with the methods herein, several muteins of basic FGF for 

chemical conjugation have been produced (preparation of muteins for recombinant 
expression of the conjugate is described below). DNA, obtained from pFC80 (see PCT 
Application Serial No. PCT/US93/05702; United States Application Serial 
No. 07/901,718; see also SEQ ID NO. 52) encoding basic FGF has been mutagenized. 

20 Mutagenesis of cysteine 78 of basic FGF (FGF-2) to serine ([C78S]FGF) or cysteine 96 
to serine ([C96S]FGF) produced two mutants that retain virtually complete proliferative 
activity of native basic FGF as judged by the ability to stimulate endothelial cell 
proliferation in culture. The activities of the two mutants and the native protein do not 
significantly differ as assessed by efficacy or maximal response. Sequence analysis of 

25 the modified DNA verified that each of the mutants has one codon for cysteine 
converted to that for serine. The construction and biological activity of FGF-1 with 
cysteine substitutions of one, two or ail three cysteines has been disclosed (U.S. Patent 
No. 5,223,483). The mitogenic activity of the mutants was similar to or increased over 
the native protein. Thus, any of the cysteines may be mutated and FGF-1 will still bind 

30 and internalize. 
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The resulting mutein FGF or unmodified FGF is reacted with a nucleic 
acid binding domain. The bFGF muteins may react with a single species of derivatized 
nucleic acid binding domain (mono-derivatized nucleic acid binding domain), thereby 
resulting in monogenous preparations of FGF-nucIeic acid binding domain conjugates 
5 and homogeneous compositions of FGF-nucleic acid binding domain chemical 
conjugates. The resulting chemical conjugates do not aggregate and retain the requisite 
biological activities. 

VEGF or HBEGF may be isolated from a suitable source or may be 
produced using recombinant DNA methodology, discussed below. To effect chemical 
10 conjugation herein, the growth factor protein is conjugated generally via a reactive 
amine group or thiol group to the nucleic acid binding domain directly or through a 
linker to the nucleic acid binding domain. The growth factor protein is conjugated 
either via its N-terminus, C-terminus, or elsewhere in the polypeptide. In preferred 
embodiments, the growth factor protein is conjugated via a reactive cysteine residue to 
15 the linker or to the nucleic acid binding domain. The growth factor can also be 
modified by addition of a cysteine residue, either by replacing a residue or by inserting 
the cysteine, at or near the amino or carboxyl terminus, within about 20, preferably 10 
residues from either end, and preferably at or near the amino terminus. 

In certain embodiments, the heterogeneity of preparations may be 
20 reduced by mutagenizing the growth factor protein to replace reactive cysteines, 
leaving, preferably, only one available cysteine for reaction. The growth factor protein' 
is modified by deleting or replacing a site(s) on the growth factor that causes the 
heterogeneity. Such sites are typically cysteine residues that, upon folding of the 
protein, remain available for interaction with other cysteines or for interaction with 
25 more than one cytotoxic molecule per molecule of heparin-binding growth factor 
peptide. Thus, such cysteine residues do not include any cysteine residue that are 
required for proper folding of the growth factor or for retention of the ability to bind to 
a growth factor receptor and internalize. For chemical conjugation, one cysteine residue 
that, in physiological conditions, is available for interaction, is not replaced because it is ■ 
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used as the site for linking the cytotoxic moiety. The resulting modified heparin-binding 
growth factor is conjugated with a single species of cytotoxic conjugate. 

Alternatively, the contribution of each cysteine to the ability to bind to 
VEGF, HBEGF or other heparin-binding growth factor receptors may be determined 
5 empirically as described herein. Each cysteine residue may be systematically replaced 
with a conservative amino acid change (see Table 1, above) or deleted. The resulting 
mutein is tested for the requisite biological activity: the ability to bind to growth factor 
receptors and internalize linked nucleic acid binding domain and agents. If the mutein 
retains this activity, then the cysteine residue is not required. Additional cysteines are 
10 systematically deleted and replaced and the resulting muteins are tested for activity. 
Each of the remaining cysteine residues may be systematically deleted and/or replaced 
by a serine residue or other residue that would not be expected to alter the structure of 
the protein. The resulting peptide is tested for biological activity. If the cysteine residue 
is necessary for retention of biological activity it is not deleted; if it not necessary, then 

15 - it is preferably replaced. with^a serine or other resjdue that should not alter the secondary 
structure of the resulting protein. In this manner the minimum number and identity of 
the cysteines needed to retain the ability to bind to a heparin-binding growth factor 
receptor and internalize may be determined. It is noted, however, that modified or 
mutant heparin-binding growth factors may exhibit reduced or no proliferative activity, 

20 but may be suitable for use herein, if they retain the ability to target a linked cytotoxic 
agent to cells bearing receptors to which the unmodified heparin-binding growth factor 
binds and result in internalization of the cytotoxic moiety. In the case of VEGF, 
VEGF, 21 contains 9 cysteines and each of VEGFj^j, VEGF189 and VEGF206 contain 7 
additional residues in the region not present in VEGF,2,. Any of the 7 are likely to be 

25 non-essential for targeting and internalization of linked cytotoxic agents. Recently, the 
role of Cys-25, Cys-56, Cys-67, Cys-101, and Cys-145 in dimerization and biological 
activity was assessed (Claffery et al., Biochem, Biophys. Acta 1246:1-9, 1995). 
Dimerization requires Cys-25, Cys-56, and Cys-67. Substitution of any one of these 
cysteine residues resulted in secretion of a monomeric VEGF, which was inactive in 

30 both vascular permeability and endothelial cell mitotic assays. In contrast, substitution 
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Of Cys <45 had no effc, on dtaerizaUon. although biological acivities were so^ewha. 
reduced. S„bs.iMi„n of Cy.-,01 did no, rcsul, i„ ,he production of a secreted or 
cytoplasmic protein. Thus, substihition of Cys-145 is preferred. 

The VEOF monomers are preferably linked via non-essential cysteine 
.endues to the linkers or to the targeted agent. VEOF that has been modified by 
mtroducuon of a Cys residue a. or near one terminus, preferably the N-terminus is 
p..f«Ted for use in chemical conjugation. For use herein, preferably the VEOF is 
dmterized prior to linkage to the linker and/or targeted agent. Methods for coupling 
proten. ,o the ,inke,3. such as the he.e„>bift.nctional agents, or to nucleic acids, or to 
protems are knov™ to those of skill in U,e art and are also described herein. 

For recombinant expression using the methods described herein, up to all 
cysteines in the HBEGF polyp^tide that are no, required for biological acHvity can be 
deleted or replaced. Al,en,a,ively. for use in Ae chemical conjugaiion meU,ods herein 
all exc^t one of these cysteines, which will be used for chemical conjugation to the' 
cytotoxtc agent, can be deleted or replaced. Each of ,he HBEOF polypeptides described 
herem have six cysteine residues. Each of the six cysteines may independemly be 
..placed and ,he resuUing mutein ,es,ed for ,he ability ,o bind ,o HBEOF recep.ors and 
to be .ntemalized. Alternatively, the resulting mutein-encoding DNA is used as part of 

20 ^ST' "--ta^ "-ain linked to the 

HBEGF-encoding DNA. The construct is expressed in a suitable host cell and the 
resulttng protein tested for the ability to bind to HBEGF receptors and inremalize As 
long as this ability is retained the mutein is suitable for use herein. 

Methods for chemical conjugation of proteins are known to those of skill 
H, the art. The preferred methods for chemical conjugation depend on the selected 
components, bu, preferably rely on disulfide bond fo™,aUon. For example if the 
targeted agent is SPDP-derivatized saporin, then it is advantageous to dimetize the 
VEGF motety prior coupling or conjugating to d,e derivatized saporin. If VEOF is • 
modtfied ,0 include a cysteine residue at or near the N, preferably, or C- terminus, then 
dtmerizaUon should follow coupling to the nucleic acid binding domain. To effect ' 
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chemical conjugation herein, the HBEGF polypeptide is linked via one or more selected 
linkers or directly to the nucleic acid binding domain. 

b. Preparation of nucleic acid binding domains for chemical 
5 conjugation 

A nucleic acid binding domain is prepared for chemical conjugation. For 
chemical conjugation, a nucleic acid binding domain may be derivatized with SPDP or 
other suitable chemicals. If the binding domain does not have a Cys residue available 
for reaction, one can be either inserted or substituted for another amino acid. If desired, 

10 mono-derivatized species may be isolated, essentially as described. 

For chemical conjugation, the nucleic acid binding domain may be 
derivatized or modified such that it includes a cysteine residue for conjugation to the 
receptor-binding internalized ligand. Typically, derivatization proceeds by reaction 
with SPDP. This results in a heterogeneous population. For example, nucleic acid 

-15 binding domain that Js derivatized by SPDP to a level of 0.9 moles pyridine-disulfide 
per mole of nucleic acid binding domain includes a population of non-derivatized, 
mono-derivatized and di-derivatized SAP, nucleic acid binding domain proteins, which 
are overly derivatized with SPDP, may lose ability to bind nucleic acid because of 
reaction with sensitive lysines (Lambert et al.. Cancer Treat, Res. 57:175-209, 1988). 

20 The quantity of non-derivatized nucleic acid binding domain in the preparation of the 
non-purified material can be difficult to judge and this may lead to errors in being able 
to estimate the correct proportion of derivatized nucleic acid binding domain to add to 
the reaction mixture. 

Because of the removal of a negative charge by the reaction of SPDP 

25 with lysine, the three species, however, have a charge difference. The methods herein 
rely on this charge difference for purification of mono-derivatized nucleic acid binding 
domain by Mono-S cation exchange chromatography. The use of purified mono- 
derivatized nucleic acid binding domain has distinct advantages over the non-purified 
material. The amount of receptor-binding internalized ligand that can react with nucleic 

30 acid binding domain is limited to one molecule with the mono-derivatized material, and 
it is seen in the results presented herein that a more homogeneous conjugate is 
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15 



produced. There may still be sources of heterogeneity with the mono-derivatized 
nucleic acid binding domain used here but is acceptable as long as binding to the 
cytocide-encoding agent is not impacted. 

Because more than one amino group on the nucleic acid binding domain 
may react with the succinimidyl moiety, it is possible that more than one amino group 
on the surface of the protein is reactive. This creates potential for heterogeneity in the 
mono-derivatized nucleic acid binding domain. 

As an alternative to derivatizing to introduce a sulfhydryl, the nucleic 

acid binding domain can be modified by the introduction of a cysteine residue. 

Preferred loci for introduction of a cysteine residue include the N-terminus region, 

preferably within about one to twenty residues from the N-terminus of the nucleic acid 

binding domain. 

Using either methodology (reacting mono-derivatized nucleic acid 
binding domain or introducing a Cys residue into nucleic acid binding domain), the 
resulting preparations of chemical conjugates are monogenous; compositions containing 
the conjugates also appear to be free of aggregates. 



Fusion protein of receptor-binding inte r nalized lipand s and nucleir. ^rici 
binding domain 



20 



As a preferred alternative, heterogeneity can be avoided by producing a 
fiision protein of receptor-binding internalized ligand and nucleic acid binding domain, 
as described below. Expression of DNA encoding a fusion of a receptor-binding 
internalized ligand polypeptide linked to the nucleic acid binding domain results in a 
more homogeneous preparation of cytotoxic conjugates. Aggregate formation can be 
25 reduced in preparations containing the fusion proteins by modifying the receptor- 
binding internalized ligand, such as by removal of nonessential cysteines, and/or the 
nucleic acid binding domain to prevent interactions between conjugates via free 
cysteines. Optionally, one or more coding regions for endosome-disruptive peptide 
may be constructed as part of the fusion protein. 

DNA encoding the polypeptides may be isolated, synthesized or 
obtained from commercial sources or prepared as described herein. Expression of 
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recombinant polypeptides may be performed as described herein; and DNA encoding 
these polypeptides may be used as the starting materials for the methods herein. 

As described above, DNA encoding FGF, VEGF, HBEGF hepatocyte 
growth factor, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-8, IL-IO, IL-11, IL-13, TNF, GM- 
5 CSF, IFN and IGF polypeptides and/or the amino acid sequences of these factors are 
described above. DNA may be prepared synthetically based on the amino acid or DNA 
sequence or may be isolated using methods known to those of skill in the art, such as 
PGR, probe hybridization of libraries, and the like or obtained from commercial or other 
sources. For example, suitable methods are described in the Examples for amplifying 
10 FGF encoding cDNA from plasmids containing FGF encoding cDNA. 

As described herein, such DNA may then be mutagenized using standard 
methodologies to delete or replace any cysteine residues that are responsible for 
aggregate formation. If necessary, the identity of cysteine residues that contribute to 
aggregate formation may be determined empirically, by deleting and/or replacing a 
-15 cysteine residue and ascertaining whether the resulting, growth factor v^th ihe deleted 
cysteine forms aggregates in solutions containing physiologically acceptable buffers 
and salts. Loci for insertion of cysteine residues may also be determined empirically. 
Generally, regions at or near (within 20, preferably 10 amino acids) the C- or, 
preferably, the N-terminus are preferred. 
20 The DNA construct encoding the fiision protein can be inserted into a 

plasmid and expressed in a selected host, as described above, to produce a recombinant 
receptor-binding internalized ligand — nucleic acid binding domain conjugate. Muhiple 
copies of the chimera can be inserted into a single plasmid in operative linkage with one 
promoter. When expressed, the resulting protein will then be a multimer. Typically, 
25 two to six copies of the chimera are inserted, preferably in a head to tail fashion, into 
one plasmid. 

a. Preparation of muteins for recombinant production of the fusion 
protein 

30 Removal of cysteines not required for binding and internalization is 

preferred for both chemical conjugation and recombinant methods in the chemical 
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conjugation methods, all except one cysteine, which is necessary for chemical 
conjugation are deleted or replaced. In practice, it appears that for FGF polypeptides 
only two cysteines (including each of the cysteine residues set forth in Table 3), and 
perhaps only the cysteines set forth in Table 3, are required for retention of the requisite 
5 biological activity of the FGF peptide. Thus, FGF peptides that have more than two 
cysteines are modified by replacing the remaining cysteines with serines. The resulting 
muteins may be tested for the requisite biological activity. 

FGF peptides, such as FGF-3, FGF-4 and FGF-6, that have two cysteines 
can be modified by replacing the second cysteine, which is not listed in Table 3, and the 
10 resulting mutein used as part of a construct containing DNA encoding the cytotoxic 
agent linked to the FGF-encoding DNA. The construct is expressed in a suitable host 
cell and the resulting protein tested for the ability to bind to FGF receptors and 
internalize the cytotoxic agent. As exemplified herein, conjugates containing bFGF 
muteins in which Cys^S and Cys96 have been replaced with serine residues have been 
1 5 prepared. 



20 



25 



30 



b- DNA constructs and exprt^ ssion of the D NA cnnsfmrt.; 
To produce monogenous preparations of fusion protein, DNA encoding 
the FGF protein or other receptor-binding internalized ligand is modified so that, upon 
expression, the resulting FGF portion of the fusion protein does not include any 
cysteines available for reaction. In preferred embodiments, DNA encoding an FGF 
polypeptide is linked to DNA encoding a nucleic acid binding domain. The DNA 
encoding the FGF polypeptide or other receptor-binding internalized ligand is modified 
in order to remove the translation stop codon and other transcriptional or translational 
stop signals that may be present and to remove or replace DNA encoding the available 
cysteines. The DNA is then ligated to the DNA encoding the nucleic acid binding 
domain polypeptide directly or via a linker region of one or more codons between the 
first codon of the nucleic acid binding domain and the last codon of the FGF. The size 
of the linker region may be any length as long as the resulting conjugate binds and is 
internalized by a target cell. Presently, spacer regions of from about one to about 
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seventy-five to ninety codons are preferred. The order of the receptor-binding 
internalized ligand and nucleic acid binding domain in the fusion protein may be 
reversed. If the nucleic acid binding domain is N-terminal, then it is modified to 
remove the stop codon and any stop signals. 
5 As discussed above, any heparin-binding protein, including FGF, VEGF, 

HBEGF, cytokine, growth factor and the like may be modified and expressed in accord 
v^th the methods herein. Binding to an FGF receptor followed by internalization are 
the only activities required for an FGF protein to be suitable for use herein. All of the 
FGF proteins induce mitogenic activity in a wide variety of normal diploid mesoderm- 
1 0 derived and neural crest-derived cells and this activity is mediated by binding to an FGF 
cell surface receptor followed by internalization. A test of such "FGF mitogenic 
activity", which reflects the ability to bind to FGF receptors and to be internalized, is 
the ability to stimulate proliferation of cultured bovine aortic endothelial cells (see, e,g., 
Gospodarowicz et al., J. Biol. Chem, 257:12266-12278, 1982; Gospodarowicz et al., 
15 - Proc. NatL Acad Sci. -USA 73:4120A124,A9^^^ 

If the FGF or other ligand has been modified so as to lack mitogenic 
activity or other biological activities, binding and intemalization may still be readily 
assayed by any one of the following tests or other equivalent tests. Generally, these 
tests involve labeling the ligand, incubating it with target cells, and visualizing or 
20 measuring intracellular label. For example, briefly, FGF may be fluorescently labeled 
with FITC or radiolabeled with ^^H. Fluorescein-conjugated FGF is incubated with 
cells and examined microscopically by fluorescence microscopy or confocal 
microscopy for intemalization. When FGF is labeled with ^^H, the labeled FGF is 
incubated with cells at 4°C. Cells are temperature shifted to 37*^0 and washed with 2 M 
25 NaCl at low pH to remove any cell-bound FGF. Label is then counted and thereby 
measuring intemalization of FGF. Alternatively, the ligand can be conjugated with an 
nucleic acid binding domain by any of the methods described herein and complexed 
with a plasmid encoding saporin. As discussed below, the complex may be used to 
transfect cells and cytotoxicity measured. 
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The DNA encoding the resulting receptor-binding internalized 
hgand-nucleic acid binding domain can be inserted into a plasmid and expressed in a 
selected host, as described above, to produce a monogenous preparation. Fusion 
proteins of FGF-2 and protamine are especially suitable for use in the present invention. 
5 Multiple copies of the modified receptor-binding internalized 

ligand/nucleic acid binding domain chimera can be inserted into a single plasmid in 
operative linkage with one promoter. When expressed, the resulting protein will be a 
multimer. Typically two to six copies of the chimera are inserted, preferably in a head 
to tail fashion, into one plasmid. 

- ^^""^^y of example, DNA encoding human bFGF-SAP having 

SEQ ID NO. 52 has been mutagenized as described in the Examples using splicing by 
overlap extension (SOE). Another preferred coding region is set forth in SEQ ID 
NO. 53. In both instances, in preferred embodiments, the DNA is modified by 
replacing the cysteines at positions 78 and 96 with serine. The codons encoding 
15 cysteine residues at positions 78 and 96 of FGF were converted to serine codons by 
SOE. Each application of the SOE method uses two amplified oligonucleotide 
products, which have complementary ends as primers and which include an altered 
codon at the locus at which the mutation is desired, to produce a hybrid product. A 
second amplification reaction that uses two primers that anneal at the non-overlapping 
20 ends amplify the hybrid to produce DNA that has the desired alteration. 



25 



30 



^- Binding of the receptor-hindinp intemahVeri liaa nd/nunle.V ...H h;.^.-^^ 
domain conjimate to cvtocide-encodinp ; appnti 

The receptor-binding internalized ligand/nucleic acid binding domain is 
incubated with the cytocide-encoding agent, preferably a linear DNA molecule, to be 
delivered under conditions that allow binding of the nucleic acid binding domain to the 
agent. Conditions will vary somewhat depending on the nature of the nucleic acid 
binding domain, but will typically occur in O.IM NaCl and 20 mM HEPES or other 
similar buffer. Alternatively, salt conditions can be varied to increase the packing or 
condensation of DNA. The extent of binding is preferably tested for each preparation. 
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After complexing, additional nucleic acid binding domain, such as poly-L-lysine, may 
be added to further condense the nucleic acid. 

Merely by way of example, test constructs have been made and tested. 
One construct is a chemical conjugate of bFGF and poly-L-lysine. The bFGF molecule 
5 is a variant in which the Cys residue at position 96 has been changed to a serine; thus, 
only the Cys at position 78 is available for conjugation. This bFGF is called FGF2-3. 
The poly-L-lysine was derivatized with SPDP and coupled to FGF2-3. This FGF2- 
3/poly-L-lysine conjugate was used to deliver a plasmid able to express the 
P-galactosidase gene. 

10 The ability of a construct to bind nucleic acid molecules may be 

conveniently assessed by agarose gel electrophoresis. Briefly, a plasmid, such as pSVp, 
is digested with restriction enzymes to yield a variety of fragment sizes. For ease of 
detection, the fragments may be labeled with ^^p either by filling in of the ends with 
DNA polymerase I or by phosphorylation of the 5'-end with polynucleotide kinase 

15 - following .dephosphprylation by alkaline phosphatase. The plasmid fragments are then 
incubated with the receptor-binding internalized ligand/nucleic acid binding domain in 
this case, FGF2-3/poly-L-lysine in a buffered saline solution, such as 20 mM HEPES, 
pH 7.3, O.IM NaCl. The reaction mixture is electrophoresed on an agarose gel 
alongside similarly digested, but nonreacted fragments. If a radioactive label was 

20 incorporated, the gel may be dried and autoradiographed. If no radioactive label is 
present, the gel may be stained with ethidium bromide and the DNA visualized through 
appropriate red filters after excitation with UV. Binding has occurred if the mobility of 
the fragments is retarded compared to the control. In the example case, the mobility of 
the fragments was retarded after binding with the FGF2-3/poly-L-lysine conjugate. If 

25 there is insufficient binding, poly-L-lysine may be additionally added until binding is 
observed. 

Further testing of the conjugate is performed to show that it binds to the 
cell surface receptor and is internalized into the cell. It is not necessary that the 
receptor-binding internalized ligand part of the conjugate retain complete biological 
30 activity. For example, FGF is mitogenic on certain cell types. As discussed above, this 
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activity may not always be desirable. If this activity is present, a proliferation assay is 
perfonned. Likewise, for each desirable activity, an appropriate assay may be 
performed. However, for application of the subject invention, the only criteria that need 
be met are receptor binding and internalization. 
5 Receptor binding and internalization may be measured by the following 

three assays. (1) A competitive inhibition assay of the complex to cells expressing the 
appropriate receptor demonstrates receptor binding. (2) Receptor binding and 
internalization may be assayed by measuring expression of a reporter gene, such as 
P-gal (e.^., enzymatic activity), in cells that have been transformed with a complex of a 
10 plasmid encoding a reporter gene and a conjugate of a receptor-binding internalized 
ligand and nucleic acid binding domain. This assay is particularly usefiil for optimizing 
conditions to give maximal transfonnation. Thus, the optimum ratio of receptor- 
binding internalized ligand/nucleic acid binding domain to nucleic acid and the amount 
of DNA per cell may readily be determined by assaying and comparing the enzymatic 
1 5 activity of P-gal. As such, these first two assays are useful for preliminary analysis and 
failure to show receptor binding or P-gal activity does not per se eliminate a candidate 
receptor-binding internalized ligand/nucleic acid binding domain conjugate or fusion 
protein from further analysis. (3) The preferred assay is a cytotoxicity assay performed 
on cells transformed with a cytocide-encoding agent bound by receptor-binding 
20 internalized ligand/nucleic acid binding domain. While, in general, any cytocidal 
molecule may be used, ribosome inactivating proteins are preferred and saporin, or 
another type I ribosome inactivating protein, is particularly preferred. A statistically 
significant reduction in cell number demonstrates the ability of the receptor-binding 
internalized ligand/nucleic acid binding domain conjugate or fusion to deliver nucleic 
25 acids into a cell. 



Conjugation of ligand to nucleic acid and hmdin g to nimleir .niH 
domain 

As an alternative, the receptor-internalized binding ligand may be 
30 conjugated to the nucleic acid, either directly or through a linker. Methods for 
conjugating nucleic acids, at the 5' ends, 3' ends and elsewhere, to the amino and 
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carboxyl termini and other sites in proteins are known to those of skill in the art (for a 
review see, e,g,, Goodchild, (1993) In: Perspectives in Bioconjugate Chemistry, Mears, 
Ed., American Chemical Society, Washington, D.C. pp. 77-99). For example, proteins 
have been linked to nucleic acids using ultraviolet irradiation (Sperling et al. (1978) 
5 Nucleic Acids Res. 5:2755-2773; Fiser et al. (1975) FEES Lett, 52:281-283), 
bifunctional chemicals (Baumert et al. (1978; Eur. J, Biochem, 5P.'353-359; and Oste et 
al. (1979) Mol. Gen. Genet. 7(55:81-86) and photochemical cross-linking (Vanin et al. 
(1981) FEBSLett. J24:S9-92; Rinke et al. (1980) J. MoL Biol 757:301-314; MiUon et 
al. (1980) £:wr. J, Biochem, 770:485-454). 
10 In particular, the reagents (N-acetyl-N*-(p-glyoxylyIbenzolyl)cystamine 

and 2-iminothiolane have been used to couple DNA to proteins, such as 
a-macroglobulin (a2M) via mixed disulfide formation {see Cheng et al.. Nucleic Acids 
Res. 1 1 :659-669, 1 983). N-acetyl-N'-(p-glyoxylylben2olyl)cystamine reacts 
specifically with nonpaired guaninine residues and, upon reduction, generates a free 
15 sulfhydryl group. 2-iminothiolane reacts with proteins to generate sulfhydryl groups 
that are then conjugated to the derivatized DNA by an intermolecular disulfide 
interchange reaction. Any linkage may be used provided that the targeted nucleic acid 
is active upon internalization of the conjugate. Thus, it is expected that cleavage of the 
linkage may be necessary, although it is contemplated that for some reagents, such as 
20 DNA encoding ribozymes linked to promoters or DNA encoding therapeutic agents for 
delivery to the nucleus, such cleavage may not be necessary. 

Thiol linkages, which are preferred, can be readily formed using 
heterbiofunctional reagents. Amines have also been attached to the terminal 5* 
phosphate of unprotected oligonucleotides or nucleic acids in aqueous solutions by 
25 reacting the nucleic acid with a water-soluble carbodiimide, such as l-ethyl-3'[3- 
dimethylaminopropyl]carbodiimide (EDC) or N-ethyl-N*(3-dimethylaminopropylcar- 
bodiimidehydrochloride (EDCI), in imidazole buffer at pH 6 to produce the 
5'phosphorimidazolide. Contacting the 5'phosphorimidazolide with amine-containing 
molecules, such as an FGF, and ethylenediamine, results in stable phosphoramidates 
30 {see, e.g.. Chu et al.. Nucleic Acids Res. 77:6513-6529, 1983; and WO 88/05077). In 
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particular, a solution of DNA is saturated with EDC, at pH 6 and incubated with 
agitation at 4°C overnight. The resulting solution is then buffered to pH 8.5 by adding, 
for example about 3 volutes of 1 00 mM citrate buffer, and adding about 5 pg - about 20 
Mg of an FGF, and agitating the resulting mixture at 4°C for about 48 hours. The 
5 unreacted protein may be removed from the mixture by column chromatography using, 
for example, Sephadex G75 (Pharmacia) using 0.1 M ammonium carbonate solution,' 
pH 7.0 as an eluting buffer. The isolated conjugate may be lyophllized and stored until 
used. 

U.S. Patent No. 5,237,016 provides methods for preparing nucleotides 
10 that are bromacetylated at their 5' termini and reacting die resulting oligonucleotides 
with thiol groups. Oligonucleotides derivatized at their 5'-termini bromoacetyl groups 
can be prepared by reacting 5'-aminohexyl-phosphoramidate oligonucleotides with 
bromoacetic acid-N-hydroxysuccinimide ester as described in U.S. Patent 
No. 5,237,016. This patent also describes methods for preparing thiol-derivatized 
1 5 nucleotides, which can then be reacted with thiol groups on the selected growth factor. 
Briefly, thiol-derivatized nucleotides are prepared using a 5'-phosphorylated nucleotide 
in two steps: (I) reaction of the phosphate group with imidazole in the presence of a 
diimide and displacement of the imidazole leaving group with cystamine in one reaction 
step; and reduction of the disulfide bond of the cystamine linker with dithiothreitol (see 
20 also, Orgel et al. ((1986) Nucl Acids Res. 14:651, which describes a similar procedure).' 
The 5'-phosphorylated starting oligonucleotides can be prepared by methods known to 
those of skill in the art (see, e.g.. Maniatis et al. (1982) Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Laboratory, New York, p. 1 22). 

The nucleic acid, such as a methylphosphonate oligonucleotide (MP- 
25 oligomer), may be derivatized by reaction with SPDP or SMPB. The resulting MP- 
oligomer may be purified by HPLC and then coupled to an FGF, such as an FGF or 
FGF mutein, modified by replacement of one or more cysteine residues, as described 
above. The MP-oligomer (about 0.1 ^M) is dissolved in about 40-50 ^il of 1:1 
acetonitrile/water to which phosphate buffer (pH 7.5, final concentration 0.1 M) and a 1 
30 mg MP-oligomer in about 1 ml phosphate buffered saline is added. The reaction is 
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allowed to proceed for about 5-10 hours at room temperature and is then quenched with 
about 15 |iL 0.1 iodoacetamide. FGF-oligonucleotide conjugates can be purified on 
heparin sepharose Hi Trap columns (1 ml, Pharmacia) and eluted with a linear or step 
gradient. The conjugate should elute in 0.6 M NaCl. 
5 The ligand may be conjugated to the nucleic acid construct encoding the 

cytocide or cytotoxic agent or may be conjugated to a mixture of oligonucleotides 
complementary to one strand of the construct. The oligonucleotides are then added to 
single stranded construct produced by melting a double-stranded construct or grown and 
isolated as single-stranded. As a general guideline, the oligonucleotides should 

10 hybridize at a higher temperature than the construct alone, if a double-stranded 
construct is used as the starting material. The gaps are filled in by DNA polymerase I to 
generate a construct with one strand conjugated to ligand and one strand unconjugated. 
Oligonucleotides conjugated to ligand and complementary to the other strand may be 
used in addition to generate a mixture of constructs with different strands linked to 

15 ligand. Any remaining single stranded plasmid may be digested with a single strand 
specific endonuclease. The ligand-conjugated constructs are then mixed with a nucleic 
acid binding domain, such as protamine or polylysine, to effect condensation of the 
construct for delivery. Optimal ratios of ligand to DNA may be determined 
experimentally by receptor-mediated transfection of a construct containing a reporter 
20 gene. 



J- Formulation and administration of pharmaceutical compositions 

The conjugates and complexes provided herein are useful in the 
treatment and prevention of various diseases, syndromes, and hyperproliferative 

25 disorders. As used herein, "treatment" means any manner in which the symptoms of a 
conditions, disorder or disease are ameliorated or otherwise beneficially altered. 
Treatment also encompasses any pharmaceutical use of the compositions herein. As 
used herein, "amelioration" of the symptoms of a particular disorder by administration 
of a particular pharmaceutical composition refers to any lessening, whether permanent 

30 or temporary, lasting or transient that can be attributed to or associated with 
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administration of the composition. For example, these conjugates and complexes may 
be used to treat complications of the eye following laser surgery, glaucoma surgery, and 
removal of pterygii. Following these treatments, reoccurrence of the problem often 
ensues due to proliferation of cells in the cornea or eye. The conjugates and complexes 
5 inhibit the proliferation of these cells. The conjugates and complexes may be used in 
general to treat pathophysiological conditions, especially FGF-, VEGF-, or 
HBEGF-mediated pathophysiological conditions by specifically targeting to cells 
having corresponding receptors. 

As used herein, "FGF-mediated pathophysiological condition" refers to a 
10 deleterious condition characterized by or caused by proliferation of cells that are 
sensitive to FGF mitogenic stimulation. Basic FGF-mediated pathophysiological 
conditions include, but are not limited to, melanoma, other tumors, rheumatoid arthritis, 
restenosis, Dupuytren's Contracture and certain complications of diabetes, such as 
proliferative retinopathy. 

As used herein, "HBEGF-mediated pathophysiological condition" refers 
to a deleterious condition characterized by or caused by proliferation of cells that are 
sensitive to HBEGF mitogenic stimulation. HBEGF-mediated pathophysiological 
conditions include conditions involving pathophysiological proliferation of smooth 
muscle cells, such as restenosis, certain tumors, such as solid tumors including breast 
and bladder tumors, tumors involving pathophysiological expression of EGF receptors, 
dermatological disorders, such as psoriasis, and ophthalmic disorders involving 
epithelial cells, such as recurrence of pterygii and secondary lens clouding. 

Similarly, tumors and hyperproliferating cells expressing cytokine 
receptors or growth factor receptors may be eliminated. Such diseases include 
restenosis, Dupuytren's Contracture, diabetic retinopathies, rheumatoid arthritis, 
Kaposi's sarcoma, lymphomas, leukemias, tumors such as renal cell carcinoma, colon 
carcinoma, breast cancer, bladder cancer, disorders with underlying vascular 
proliferation, such as diseases in the back of the eye (e.g., proliferative 
vitreoritinopathy, inacular degeneration and diabetic retinopathy). For treatment of the - 
back of the eye especially, use of the VEGF-receptor promoter to control expression of 
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the cj^ocide or cytotoxic agent is preferred. The conjugates may be used to prevent 
corneal haze or clouding that results from exposure of the cornea to laser radiation 
during eye surgery, particularly LRK. The haze or cloudmg appears to result from 
fibroblastic keratocyte proliferation in the subepithelial zone following photoablation of 
5 the cornea. 

The conjugates may be used to treat a "hyperproliferative skin disorder." 
As used herein, it is a disorder that is manifested by a proliferation of endothelial cells 
of the skin coupled with an underlying vascular proliferation, resulting in a localized 
patch of scaly or homy or thickened skin or a tumor of endothelial origin. Such 
10 disorders include actinic and atopic dermatitis, toxic eczema, allergic eczema, psoriasis, 
skin cancers and other tumors, such as Kaposi's sarcoma, angiosarcoma, hemangiomas, 
and other highly vascularized tumors, and vascular proliferative responses, such as 
varicose veins. 

As well, the conjugates may be used to treat or prevent restenosis, a 

15 process and the resulting condition that occurs following angioplasty in which the 
arteries become reclogged. After treatment of arteries by balloon catheter or other such 
device, denudation of the interior wall of the vessel occurs, including removal of the 
endothelial cells that constitute the lining of the blood vessels. As a result of this 
removal and the concomitant vascular injury, smooth muscle cells (SMCs), which form 

20 the blood vessel structure, proliferate and fill the interior of the blood vessel. This 
process and the resulting condition is restenosis. 

Pharmaceutical carriers or vehicles suitable for administration of the 
conjugates and complexes provided herein include any such carriers known to those 
skilled in the art to be suitable for the particular mode of administration. In addition, 

25 the conjugates and complexes may be formulated as the sole pharmaceutical ly active 
ingredient in the composition or may be combined with other active ingredients. 

The conjugates and complexes can be administered by any appropriate 
route, for example, orally, parenterally, including intravenously, intradermally, 
subcutaneously, or topically, in liquid, semi-liquid or solid form and are formulated in a 

30 manner suitable for each route of administration. Preferred modes of administration 
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depend upon the indication treated. Dermatological and ophthalmologic indications 
will typically be treated locally; whereas, tumors and restenosis, will typically be treated 
by systemic, intradermal, or intramuscular modes of administration. 

The conjugates and complexes herein may be formulated into 
5 pharmaceutical compositions suitable for topical, local, intravenous and systemic 
application. For the ophthalmic uses herein, local administration, either by topical 
administration or by injection is preferred. Time release fonnulations are also desirable. 
Effective concentrations of one or more of the conjugates and complexes are mixed 
with a suitable pharmaceutical carrier or vehicle. As used herein an ''effective amount" 
10 of a compound for treating a particular disease is an amount that is: sufficient to 
ameliorate, or in some manner reduce the symptoms associated with the disease. Such 
amount may be administered as a single dosage or may be administered according to a 
regimen, whereby it is effective. The amount may cure the disease but, typically, is 
administered in order to ameliorate the symptoms of the disease. Repeated 
1 5 administration may be required to achieve the desired amelioration of symptoms. 

As used herein, "an ophthalmically effective amount" is that amount 
which, in the composition administered and by the technique administered, provides an 
amount of therapeutic agent to the involved eye tissues sufficient to prevent or reduce 
corneal haze following excimer laser surgery, prevent closure of a trabeculectomy, 
20 prevent or substantially slow the recurrence of pterygii, and other conditions. 

The concentrations or amounts of the conjugates and complexes that are 
effective requires delivery of an amount, upon administration, that ameliorates the 
symptoms or treats the disease. Typically, the compositions are formulated for single 
dosage administration. Therapeutically effective concentrations and amounts may be 
25 determined empirically by testing the conjugates and complexes in known in vitro and 
in vivo systems, such as those described here; dosages for humans or other animals may 
then be extrapolated therefrom. 

The conjugate is included in the phaimaceutically acceptable carrier in 
an amount sufficient to exert a therapeutically useful effect in the absence of 
30 undesirable side effects on the patient treated. The conjugates may be delivered as 
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pharmaceutically acceptable salts, esters or other derivatives of the conjugates include 
any salts, esters or derivatives that may be readily prepared by those of skill in this art 
using known methods for such derivatization and that produce compounds that may be 
administered to animals or humans without substantial toxic effects. It is understood 
5 that number and degree of side effects depends upon the condition for which the 
conjugates and complexes are administered. For example, certain toxic and undesirable 
side effects are tolerated when treating life-threatening illnesses, such as tumors, that 
would not be tolerated when treating disorders of lesser consequence. The 
concentration of conjugate in the composition will depend on absorption, inactivation 

10 and excretion rates thereof, the dosage schedule, and amount administered as well as 
other factors known to those of skill in the art. 

Preferably, the conjugate and complex are substantially pure. As used 
herein, "substantially pure" means sufficiendy homogeneous to appear free of readily 
detectable impurities as determined by standard methods of analysis, such as thin layer 

15 chromatography (TLC), gel electrophoresis, high performance liquid chromatography 
(HPLC), used by those of skill in the art to assess such purity, or sufficiently pure such 
that further pvuification would not detectably alter the physical and chemical properties, 
such as enzymatic and biological activities, of the substance. Methods for purification 
of the compounds to produce substantially chemically pure compounds are known to 

20 those of skill in the art. A substantially chemically pure compound may, however, be a 
mixture of stereoisomers. In such instances, further purification might increase the 
specific activity of the compound. 

Typically a therapeutically effective dosage should produce "a serum 
concentration of active ingredient of from about 0.1 ng/ml to about 50-100 p.g/mL The 

25 pharmaceutical compositions typically should provide a dosage of from about 0.01 mg 
to about 100 - 2000 mg of conjugate, depending upon the conjugate selected, per 
kilogram of body weight per day. For example, for treatment of restenosis a daily 
dosage of about between 0.05 and 0.5 mg/kg (based on FGF-SAP chemical conjugate or 
an amount of conjugate provided herein equivalent on a molar basis thereto) should be 

30 sufficient. Local application for ophthalmic disorders and dermatological disorders 
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should provide abo« , „g up .o ,00 Mg. prefe^Wy abou, , „g ,„ ^ 
smgle dosage administration. I, is understood ,l,at the amount to adn,i„is,er will be a 
fimctton of fl,e conjugate selected, the indication treated, and possibly the side affects 

that will be tolerated. 

5 '^='^'^""'^'^=«'«^''««'"'="'^'ions and amounts may be detennined 

for each application hetein empirically by testing .he conjugates and complexes in 
known v,>„ and « Wvo .ystems (e.g.. murine, mt, rabbit, or baboon models,, such as 
*«se de^ribed herein; dosages for humans or other animals may then be extrapolated 
therefrom. Demonstration that the conjugates and complexes prevent or inhibit 
Ptohferauon of serum stimulated corneal keratocytes or fibroblasts explanted from eyes 
as Shown herein, and demonstration of any inhibition of ptoliferation of such tissues in' 
rabbtts should establish human efficacy. The .bbit eye model is a recognized mode, 
for smdymg the effects of topically and loc^ly applied drugs (.ee. U.S. Patent Nos. 
Sm sm. 5,262,178, 5,256,40S, 5.252,3,9, 5,238.925, 5,165,952; see ..so 
13 Miiate et al., Ctirr. Eye Res. 7:49,-493, 1981). 

The active ingredient may be administered a. once, or may be divided 
.nto a number of sm^ler doses to be administered at intervals of time. ,t is unde^tood 
that the precise dosage and duration „f .reaanent is a Action of the disease being 
treated and may be detemtined empirically using Icnown testing protocols or by 
extrapolation from ,„ Wvo or ,„ .>.o test data. It is to be noted that concentrations and 
dosage values may also vaty wiUtthe severity of the condition to be alleviated ,t is to 
be fltrther understood that for any particular subject, specific dosage regimens should be 
adjusted over time accotding to the individual need and the professional judgment of 
the person administering or supervising the admimsuation of the compositions, and that 
the concentration ranges set forti, herein .e exemplary only and are not intended to 
limit the scope or practice of the claimed compositions. 

The conjugates and complexes may be fomiulated for local or topical ■ 
apphcation, such as for topical application to dte skin and mucous membranes such as 
m the eye. in the fonn of gels, creams, and lotions and for application to tiie e^e or for " 
^0 intracistemal or intraspinal apphcation. Such solutions, particularly those intended for 
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ophthalmic use, may be fomiulated as 0.01% -10% isotonic solutions, pH about 5-7, 
with appropriate salts. The ophthalmic compositions may also include additional 
components, such as hyaluronic acid. The conjugates and complexes may be 
formulated as aerosols for topical application {see^ e.g., U.S. Patent Nos. 4,044,126, 
5 4,414,209, and 4,364,923). 

Solutions or suspensions used for parenteral, intradermal, subcutaneous, 
or topical application can include any of the following components: a sterile diluent, 
such as water for injection, saline solution, fixed oil, polyethylene glycol, glycerine, 
propylene glycol or other synthetic solvent; antimicrobial agents, such as ben2yl alcohol 
10 and methyl parabens; antioxidants, such as ascorbic acid and sodium bisulfite; chelating 
agents, such as ethylenediaminetetraacetic acid (EDTA); buffers, such as acetates, 
citrates and phosphates; and agents for the adjustment of toxicity such as sodium 
chloride or dextrose. Parental preparations can be enclosed in ampules, disposable 
syringes or multiple dose vials made of glass, plastic or other suitable material. 
15 If administered intravenously, suitable carriers include physiological 

saline or phosphate buffered saline (PBS), and solutions containing thickening and 
solubilizing agents, such as glucose, polyethylene glycol, and polypropylene glycol and 
mixtures thereof Liposomal suspensions may also be suitable as pharmaceutically 
acceptable carriers. These may be prepared according to methods known to those 
20 skilled in the art. 

Upon mixing or addition of the conjugate(s) with the vehicle, the 
resulting mixture may be a solution, suspension, emulsion or the like. The form of the 
resulting mixture depends upon a number of factors, including the intended mode of 
administration and the solubility of the conjugate in the selected carrier or vehicle. The 
25 effective concentration is sufficient for ameliorating the symptoms of the disease, 
disorder or condition treated and may be empirically determined based upon in vitro 
and/or in vivo data, such as the data from the mouse xenograft model for tumors or 
rabbit ophthalmic model. If necessary, pharmaceutically acceptable salts or other 
derivatives of the conjugates and complexes may be prepared. 
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The aoive materials can also be mixed wi,h oto acivc materials, U«, 
do „o, impair U,e desired action, or with materials that supplement the desired action 
mcludtng viscoelastic materials, such as hyaluronic acid, which is sold under the 
todemark HEALON (solution of a high molecular weigh, (MW of about 3 millions) 
5 fractton of sodium hyaluronate; manufacn^ed by Phannacia, Inc. see. eg. US Patent 
Nos. 5.292.362, 5.282,851. 5.273.056. 5.229.127. 4.517.295 and 4.328,803^ V.SCOAT 
(fluonne-containing (meth)ac,ylates, such as, .H..H.2H,2H-hepta- 
decafluorodecy,mea,ac,ylate; e.g, U.S. Pa.cn, Nos. 5,278.126. 5.273.75,. and 
5.214.080; commercially available from Alcon Surgical, ta ). ORCOLON (see e, 
0 U.S. Patent Nos. 5.273.056; commercially available from Optical Radiation' 
Cotporation). meftylcellulose. methyl hyalu„,na,e. polyacrylamidc and 
polymedtacrylamide (see, e.,.. U.S. Paten. No. 5.273.751). TT,e viscoelastic materi^s 
are ptesent gene.a)ly in amounts ranging ft„m about 0.5 to 5.0%, prefembly 1 to 3% by 
weight of the conjugate material and serve to coa, and protect fte ttea,ed .issues The 
. composiiions may also include a dye, such as meUtylene blue or o,her iner, dye. so 
tt.e composition can be seen when injeced imo ,he eye or conuced with the surgical 
site during surgery. 

The conjugates and complexes may be formulated for local or topical 
appl,cat,o„. such as for topical application ,o ,he skin and mucous membranes such as 
■n tite eye, in ,he form of gels, creams, and lo,ions and for application .o ,he eye Such 
solutions, par,icularly ,h„se imendcd for ophthalmic use. may be formulated as 0 01%- 
10% isotonic solutions. pH abou, 5-7, with appropriate sal.s. S„i«e ophtit^ic 
solutions are known (see. e.g.. U.S. Pa,e„, No. 5.1 16.868, which describes typical 
compositions of oph,halmic irrigation s„lu,ions and soluiions for topical application) 
Such solutions, which have a pH adjusted to abou, 74. contain, for example. 90-100 
sodium chloride. 4-6 mM dibasic po,^ium phospha,e. 4-6 mM dibasic sodium 
phospha,e, 8-12 mM sodium citiate. 0.5-1.5 mM magnesium chloride. I 5-2 5 mM • 
-Icium chloride, ,5-25 mM sodium ace,a,e, IO-20mM D.L.-sodium R- 
hydroxybutyrate and 5-5.5 mM glucose. 
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The conjugates and complexes may be prepared with carriers that protect 
them against rapid elimination from the body, such as time release formulations or 
coatings. Such carriers include controlled release formulations, such as, but not limited 
to, implants and microencapsulated delivery systems, and biodegradable, biocompatible 
5 polymers, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
polyorthoesters, polylactic acid and others. For example, the composition may be 
applied during surgery using a sponge, such as a commercially available surgical 
sponges (see, e,g,, U.S. Patent Nos. 3,956,044 and 4,045,238; available from Week, 
Alcon, and Mentor), that has been soaked in the composition and that releases the 
10 composition upon contact vdth the eye. These are particularly useful for application to 
the eye for ophthalmic indications follov^ng or during surgery in which only a single 
administration is possible. The compositions may also be applied in pellets (such as 
Elvax pellets(ethylene- vinyl acetate copolymer resin); about 1- 5 jig of conjugate per 1 
mg resin) that can be implanted in the eye during surgery. 
-^^ - Ophthalmologically effective concentrations or amounts of one or more 

of the conjugates and complexes are mixed with a suitable pharmaceutical carrier or 
vehicle. The concentrations or amounts of the conjugates and complexes that are 
effective requires delivery of an amount, upon administration, that prevents or 
substantially reduces corneal clouding, trabeculectomy closure, or pterygii recurrence. 
2^ The conjugates and complexes herein are formulated into 

ophthalmologically acceptable compositions and are applied to the affected area of the 
eye during or immediately after surgery. In particular, folloudng excimer laser surgery, 
the composition is applied to the cornea; following trabeculectomy the composition is 
applied to the fistula; and following removal of pterygii the composition is applied to 
25 the cornea. The compositions may also be used to treat pterygii. The conjugates and 
complexes are applied during and immediately following surgery and may, if possible 
be applied post-operatively, until healing is complete. The compositions are applied as 
drops for topical and subconjunctival application or are injected into the eye for 
intraocular application. The compositions may also be absorbed to a biocompatible 
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with the affected area. 

e * K . ™° "P'^'h^^l'-eic indications he^in are 5„icaily be tt.a,ed locally 

5 ot , "^"^ '° -^^''^ wiU, a 

b,ocon.pa>,ble .po„,e U«, has absorbed a solution of U,e conjugates and complexes or 

^P..ed dunng or i„unedia.e,y after surgery in order ,o prevent closure of.d,e 
•rabeculecron-y, proven, a proliferation ofkera.ocy.es foUo^ng cxcimer laser surges 
^.o prevent a recurrence of pretygii. ^ ^ ^■ 

Z ; 7:'^'"=-"'"<^"'-^*="ons.o.heeye. it can be slowly 
injected mto the bulbar conjunctiva of the eye. 

with photocleavable linkers are an,„ng those 
P.. e„ed or use .•„ the methods herein. Upon adntinisuation of such contposL to 

or UV that cleaves the hnker. thet^by releasing the cytotoxic agent. 

If oral admirtstration is desired, the conjugate should be provided in a 
on^postuon that protects it fton, the acidic environntent of the ston^ch. For exantple 
2 -position can be fonnulated in an enteric coating that maintains its integrity in' 

also be fonnulated ,n contbination >vith an antacid or other such ingredien,- 

Oral compositions will generally include an inetl diluent-or an edible 
earner and „ay be compressed into tablets or enclosed i„ gelatin capsules. For-the 
purpose of oral therapeutic administration, .he active compound or compounds ca. be 
mcorpotated with excipients and used in .he fo™ of tablets, capsull or Jhe^ 
Phatrnaceufcally compa.ihle binding agents and adjuvant ma.erials can be included as 
part of the composition. 

The .able.s. pills, capsules, .roches and *e like can conWn any of tf,e 
followng .ngtediems. or compounds of a similar nahire: a binder, such as " 
m,crocrys.,line cellulose, gum .ragacan.h and gelatin; an excipien. such as s.arch and 
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lactose, a disintegrating agent such as, but not limited to, alginic acid and com starch; a 
lubricant such as, but not limited to, magnesium stearate; a glidant, such as, but not 
limited to, colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
and a flavoring agent such as peppermint, methyl salicylate, and fruit flavoring. 
5 When the dosage unit form is a capsule, it can contain, in addition to 

material of the above type, a liquid carrier such as a fatty oil. In addition, dosage unit 
forms can contain various other materials v^hich modify the physical form of the dosage 
unit, for example, coatings of sugar and other enteric agents. The conjugates and 
complexes can also be administered as a component of an elixir, suspension, syrup, 
10 wafer, chev^ng gum or the like. A syrup may contain, in addition to the active 
compounds, sucrose as a sweetening agent and certain preservatives, dyes and colorings 
and flavors. 

The active materials can also be mixed with other active materials that 
do not impair the desired action, or with materials that supplement the desired action, 

15 such as cis-platin for treatment of tumors. 

Finally, the compounds may be packaged as articles of manufacture 
containing packaging material, one or more conjugates and complexes or compositions 
as provided herein v^thin the packaging material, and a label that indicates the 
indication for which the conjugate is provided. 

20 Many methods have been developed to deliver nucleic acid into cells 

including retroviral vectors, electroporation, CaPO^ precipitation and microinjection, 
but each of these methods has distinct disadvantages. Microinjecting nucleic acid into 
cells is very time consuming because each cell must be manipulated individually. 
Retroviral vectors can only hold a limited length of nucleic acid and can activate 

25 oncogenes depending upon the insertion site in the target chromosome. Conditions for 
electroporation and CaP04-mediated transfection are harsh and cause much cell death. 

By comparison, receptor mediated gene delivery as described herein is a 
more desirable method of selectively targeting toxic genes into cells that have "more 
active" receptors or that overexpress the specific receptor on the cell surface. A 

30 receptor may be more active because it has a higher rate of internalization or higher 
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cycling ra,e fl.„„gh fte endosome ,o U,e ceU surface. Advanuge. of a,s method over 
oto gene delive-y meftods include increased specificio: of delivery, Ae absence of 
nuctac acid lengU, limitations, reduced toxicity, and reduced imntunogemcity of the 
conjugate. These characteristics allow for ..peated administration of .he material with 
5 mtntmal hann to cells and may allow increased level of exp„ssion of the toxic protein 
In addtnon. primary cultures can also be treated using this method. 

The following examples are included for illustrative purposes only and 
are not intended to limit the scope of the invention. 



10 



EXAMPr.FS; 

EXAMPLE 1 
Isolation of DNA Encoding Saporin 



15 



A. Materials and methods 
^- Bacterial Strain*; 

£ coll sttain JA221 (Ipp- hdsM+ trpES leuB6 lacY recAl F'[lacN lac+ 
pro-)) is publicly available from the American Type Culture Collection (ATCC) 
20 Rockville, MD 20852, under the accession number ATCC 33875. (JA22I is also' 
avatlable fiom the Nonhem Regional Research Center (NRRL), Agricultural Research 
Servtce. U.S. Depanmem of Agricutare, Peoria, IL 61604, under the accession number 
NRRL B.15211; see also U.S. Paten. No. 4.757,013 to Inouye; and NaKamura et al 
Cell M:,109-,1I7, ,979). Stnnn INVla is commercially available from Invitrogen' 
25 San Diego, CA. ' 

2- DNA Man ipulations; 

The restriction and modification enzymes employed herein are 
commercially available in the U.S. Native saporin and rabbit polyclonal antisentm to 
saponn were obtained as previously described in Lappi et al.. Blacken,. Biophys Res 
Co„„,. ^934.942. Rtcin A chain is commercially available f^m Sigma, Milwaukee 
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WI. Antiserum was linked to Affi-gel 10 (Bio-Rad, Emeryville, CA) according to the 
manufacturer's instructions. Sequencing was performed using the Sequenase kit of 
United States Biochemical Corporation (version 2.0) according to the manufacturer's 
instructions. Minipreparation and maxipreparation of plasmids, preparation of 
5 competent cells, transformation. Ml 3 manipulation, bacterial media. Western blotting, 
and ELISA assays were according to Sambrook et al., {Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1 989). The purification of DNA fragments was done using the Geneclean II kit (Bio 
101) according to the manufacturer's instructions. SDS gel electrophoresis was 
1 0 performed on a Phastsystem (Pharmacia). 

Western blotting was accomplished by transfer of the electrophoresed 
protein to nitrocellulose using the PhastTransfer system, as described by the 
manufacturer. The antiserum to SAP was used at a dilution of 1:1000. Horseradish 
peroxidase labeled anti-IgG was used as the second antibody {see Davis et al., Basic 
15 Methods In Molecular Biology, New York, Elsevier Science Publishing Co., pp 1-338, 
1986). 



B. Isolation of DNA encoding saporin 

1- Isolation of genomic DNA and preparation of polvmerase chain reaction 
20 (PCR) Primers 

Saponaria officinalis leaf genomic DNA was prepared as described in 

Bianchi etal.. Plant Mol Biol. 77:203-214, 1988. Primers for genomic DNA 

amplifications were synthesized in. a 3 SOB automatic DNA synthesizer. The primer 

corresponding to the "sense" strand of saporin 5'- 

25 CTGCAGAATTCGCATGGATCCTGCTTCAAT-3' (SEQ ID NO. 54) includes an 
EcoK I restriction site adapter immediately upstream of the DNA codon for amino acid 
-15 of the native saporin N-terminal leader sequence. The primer 5'- 
CTGCAGAATTCGCCTCGTTTGACTACTTTG-3' (SEQ ID NO. 55) corresponds to 
the "antisense" strand of saporin and complements the coding sequence of saporin 

30 starting from the last 5 nucleotides of the DNA encoding the carboxyl end of the mature 
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peptide. Use of tliis primer introduced a translation stop codon and an EcoRl restriction 
site after the sequence encoding mature saporin. 

^- Amplificati nn of DNA pncodinp .<Mp nrin 
5 Unfractionated Saponaria officinalis leaf genomic DNA (1 ^I) was 

mixed in a final volume of 100 fxl containing 10 mM Tris-HCI (pH 8.3), 50 mM KGl 
O.Olo/o gelatin, 2 mM MgCl,, 0.2 mM dNTPs, 0.8 of each primer. Next, 2.5 U Tag 
DNA polymeric (Perkin Ehner Cetus) were added and the mixture was overlaid with 
30 m of mineral oil (Sigma). Incubations were done in a DNA ITiermal Cycler 
10 (Ericomp). One cycle included a denaturation step (94«'C for 1 min), an annealing step 
(eO-C for 2 min). and an elongation step (72°C for 3 min). After 30 cycles, a 10 ^l 
aliquot of each reaction was run on a 1.5o/„ agarose gel to verify the structure of the 
amplified product. 

The amplified DNA was digested with Ecom and subcloned into £coRI- 
15 restricted M13mpl8 (New England Biolabs, Beveriy, MA; see also Yanisch-Perron et 
al. (1985), "Improved M13 phage cloning vectors and host strains: Nucleotide 
sequences of the M13mpI8 and pUC19 vectors". Gene 33:103). Single-stranded DNA 
from recombinant phages was sequenced using oligonucleotides based on internal 
pomts m the coding sequence of saporin (see Bennati et al., Eur. J. Biochem 183 465- 
20 470, 1989). Nine of the MI3mpI8 derivatives were sequenced and compared. Of the 
nine sequenced clones, five had unique sequences, set forth as SEQ ID NOs. 19-23, 
respectively. The clones were designated M13mpl8-G4, -Gl, -G2, -G7, and -09 Each 
of these clones contains all of the saporin coding sequence and 45 nucleotides of DNA 
encoding the native saporin N-terminal leader peptide. 

sequence was also cloned in the pETl la vector. Briefly 
the DNA encoding SAP-6 was amphfied by polymerase chain reaction (PGR) from the' 
parental plasmid pZIBl. The plasmid pZlBl contains the DNA sequence for human 
FGF-2 linked to SAP-6 by a two-amino-acid linker (Ala-Met). PZIBI also includes the 
T7 promoter, lac operator, ribosomal binding site, and T7 tenninator present in the pET- 
30 11a vector. For SAP-6 DNA amplification, the 5' primer (5' 
CATATGTGTGTCACATCAATCACATTAGAT 3') (SEQ ID NO 105) 
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corresponding to the sense strand of SAP-6, incorporated a Ndel restriction enzyme site 
used for cloning. It also contained a Cys codon at position -1 relative to the start site of 
the mature protein sequence. No leader sequence was included. The 3' primer (5' 
CAGGT1TGGATCCTTTACGTT 3') (SEQ ID NO. 106) corresponding to the 
5 antisense strand of SAP-6 had a BamHl site used for cloning. The amplified DNA was 
gel-purified and digested with Ndel and BamHl. The digested SAP-6 DNA fragment 
was subcloned into the Ndel/BamHl-digssted pZlBl. TTiis digestion removed FGF-2 
and the 5' portion of SAP-6 (up to nucleotide position 650) from the parental rFGF2- 
SAP vector (pZlBl) and replaced this portion with a SAP-6 molecule containing a Cys 
1 0 at position - 1 relative to the start site of the native mature SAP-6 protein. The resultant 
plasmid was designated as pZSOB. pZ50B was transformed into £. coli strain 
NovaBlue for restriction and sequencing analysis. The appropriate clone was then 
transformed into E. coli strain BL21(DE3) for expression and large-scale production. 

^5 Mammali an codon optimi2ation of saporin cDNA. 

Mammalian expression plasmids encoding P-galactosidase (P-gal), pSV- 
P and pNASS-P, were obtained from Clontech (Palo Alto, CA). Plasmid pSVp 
expresses p-gal from the SV40 early promoter. Plasmid pNASSb is a promoterless 
mammalian reporter vector containing the P-gal gene. 

The amino acid sequence for the plant protein saporin (SAP) was reverse 
translated using mammalian codons. The resulting mammalian optimized cDNA was 
divided into 4 fragments (designated 5'-3' A-D) for synthesis by PGR using overiapping 
oligos. To facilitate subcloning of each fragment and piecing together of the entire 
cDNA, restriction enzyme sites were added to the ends of each fragment, and added or 
25 removed within each fragment without changing the corresponding amino acid 
sequence. In addition, the 5' end of the cDNA was modified to include a Kozak 
sequence for optimal expression in mammalian cells. Fragments A, B, and D were each 
synthesized by annealing 4 oligos (2 sense, 2 antisense) with 20 base overlaps and using 
PGR to fill-in and amplify the fragments. The PGR products were then purified using 
30 GeneGlean (Biol 01), digested with restriction enzymes recognizing the sites in the 
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primers, and subcloned into pBluescript (SK+) (Stratagene). The sequence of the inserts 
was verified using Sequenase Version 2.0 (United States Biochemical/Amersham). 
Fragment C was synthesized in two steps: The 5' and 3' halves of the fragment were 
independently synthesized by PGR using 2 overlapping oligos. The products of these 
5 using 2 reactions were then purified and combined and the fiill-length fragment C was 
generated by PGR using the outermost oligos as primers. Full-length fragment C was 
subcloned into pBluescript for sequencing. Fragments A and B were ligated together in 
pBluescript at an overiapping Kspl site. Fragments C and D were ligated together in 
pBluescript at an overlapping PvuU site. Fragments A-B and G-D were then joined in 
10 pBluescript at an overiapping Aval site to give the fiill-length mammalian optimized 
SAP cDNA. p-gal sequences were excised from the plasmids pNASS-p and pSV-p 
(Glontech) by digestion with Notl and replaced with the synthetic SAP gene, which has 
Notl ends. Orientation of the insert was confirmed by restriction enzyme digestion. 
Large scale plasmid preparations were performed using Qiagen Maxi 500 columns. 
The oligos used to synthesize each SAP fragment are (5'-3'): 

Al(sense):CGTATGAGGGGGGGGGGGGGATGGTGAGGTGCATCAGGGTGGACC 
TGGTGAACCCCACCGCCGGCC (SEQ ID NO.: 89) 

20 A2(antisense):TTGGGGTCCTTCACGTTGTTGCGGATCTTGTCCACGAAGGAGG 
AGTACTGGCCGGCGGTGGGGTTCACC (SEQ ID NO.: 90) 

A3(sense):AACAACGTGAAGGACCCCAACCTGAAGTACGGCGGCACCGACAT 
CGCCGTGATCGGCCCCCCCTC (SEQ ID NO.: 91) 

25 

A4(antisense):GTGCCGCGGGAGGACTGGAAGTTGATGCGCAGGAACTTCTCCT 
TGGAGGGGGGGCCGATCACGGC (SEQ ID NO.: 92) 

B 1 (sense):CTCCCGCGGCACCGTGTCCCTGGGCCTGAAGCGCGACAACCTGTA 
30 CGTGGTGGCCTACCTGGCCATGGACAACAC (SEQ ID NO.: 93) 
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B2(antisense):GCGGTCAGCTCGGCGGAGGTGATCTCGGACTTGAAGTAGTAGG 
CGCGGTTCACGTTGGTGTTGTCCATGGCCAGGTA (SEQ ID NO.: 94) 

5 B3(sense):GCCGAGCTGACCGCCCTGTTCCCTGAGGCCACCACCGCCAACCAG 
AAGGCCCTGGAGTACACCGAGGACTACCAGTCC (SEQ ID NO.: 95) 

B4(antisense):AGCCCGAGCTCCTTGCGGGACTTGTCGCCCTGGGTGATCTGGG 
CGTTCTTCTCGATGGACTGGTAGTCCTCGGTGT (SEQ ID NO.: 96) 

10 

Cl(sense):TATAGAATTCCTCGGGCTGGGCATCGACCTGCTGCTGACCTTCATG 
GAGGCCGTGAACAAGAAGGCCCGCGTGG (SEQ ID NO.: 97) 

C2(antisense):CGGCGGTCATCTGGATGGCGATCAGCAGGAAGCGGGCCTCGTT 
15 ^ CTTCACCACGCGGGCCTTCTTGTTC (SEQ ID NO.: 98) 

C3(sense):CGCCATCCAGATGACCGCCGAGGTGGCCCGCTTCCGCTACATCCA 
GAACCTGGTGACCAAGAACTTCCCC (SEQ ID NO.: 99) 

20 C4(antisense):GGCGGATCCCAGCTGACCTCGAACTGGATCACCTTGTTGTCGG 
AGTCGAACTTGTTGGGGAAGTTCTTGGTCACCA (SEQ ID NO.: 100) 

D 1 (sense):CCGGGATCCGTCAGCTGGCGC AAGATCTCCACCGCC ATCTACGGC 
GACGCCAAGAACGGCG (SEQ ID NO.: 101) 

D2(antisense):GCACCTTGCCGAAGCCGAAGTCGTAGTCCTTGTTGAACACGCC 
GTTCTTGGCGTCGCCGTAGAT (SEQ ID NO.: 102) 

D3(sense):TTCGGCTTCGGCAAGGTGCGCCAGGTGAAGGACCTGCAGATGGGC 
30 CTGCTGATGTACC (SEQ ID NO.: 103) 
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D4(antisense):TGAACGTGGCGGCCGCCTACTTGGGCTTGCCCAGG^ 
CAGGCCCAT (SEQ ID NO.: ] 04) ACATCAG 

5 

^- EQMPAG4 PlasmiH rnnct^...;^^ 

M13 mpl8-G4 was digested with EcoR I, and the resulting fragment was 

No. 4,575,013 to Inouye; and Duffaud et al., Meth. Enz 153-492 507 1Q«7. • . 

.0 ^....ea... 

saponn, _ ^^^^^^ c„..„ 

.5 !r r t ~ " wi* each 

IDNO. 19. -n>eplas™,dabo includes U,.£. co« lac repressor gene (lac I) 

™° clones, confining SEQ ,D NOs 

20-23 respecuvel,. are digested „id, ScoK 1 and „ga,ed i„,o E.oK 1 digested plN-' 
mon,pA2 as desCbed for M13 .pis.04 above in u.s exanrple. Z 'jl 
20 Plas„ids labeled pOMP.Ol, pOMP.O, pOMPAO. pOMP^, ^ si 7 
expressed, pnHfled, and characeri^d as described for d,e plasnrid pOMPA04 

INVl a co„,pe,en, cells were b^sfomed wi,h pOMPA04 and cuin^s 
~g U,e desired p,as.id sr^crure were grown ^rU,er in order ,o ob,ain T C 
P^parauon of isolated pOMPAG4 p,as„id using .eftods described herein. 

^* Saporin eypression in P ^^ff 

whrch .he expression of .he saporin-conraining prorein is repressed by ,he lac 
repressor ™il u,e end of .he log phase of grow*, a. which .i^e ,PTO was ad d ^ 
30 mduce expression of .he saporin-encoding DNA. 
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T generate a large-batch culture of pOMPAG4 trarxsfoni-l E. coli 
To generate a larg ^^^^^ 
^i.ht culture (approximately 16 hours growth) of JA221 tav 
cells, an overnight cultur ^^^^^ samb-i et al., 

transformed with the plasm.d ^ h^,,, L^^^^^^ 

Molecular Clonin,: A Lal^oratory Manual, Cold Sprtng ^ 
• « Knr NY 1989) containing 125 mg/ml ampicilhn was mm 
3 ^P-^«-^-^^^^^^^ cells we^^own at 

flas. contatmng 750 m LB ^^^^^ ^ , , 

logarithmic phase with shaking at 

Pleasured in a spectrophotometer. ^ .^^ 

In the second step, saponn expression 

. .-^^nfminM Induced cultures were gpsn tor z 

aaaUiona, hou,. ^^ ^^^^ , ^ EDTA w.e 

peue, ,e»spe„d m co ^^^^^^^ ^ 

corresponding to the pellet. ^^^^ ^ ^ 

The coli cells contaimng C-SAP construe p 

^ 'trln was injeced into a T-.-AppUkon (Fos«r Cty CA) 
f=™e„.orco»u.n,„g21 comp ^^^^^^^^^ ^ , ^ 

each of yeas, exttact and '^P'"";'^; 2 „^ of a «acc «c«l solution 

KH,PO„1.66g/lot(NH.).SO. g ^ 0 g/1 of CoCl. • 6H30, 2.0 g/1 of 

-n^irrr:' o'cX:;:: . ' Mnc„...o, . .f 

25 Na,MoO,.2HA 1- 8 j „ 30,). 2 mW of a vimmin soluuon 

Z„C..4HA..0.,o C C H ^ ^^^^ 

^, of *.an.n . HC.^ ■ ^^^^ ^ ^ ^^^^^ ^ „ ^c.,, a^d 

pyr,dox.ne.HCl 0.2 ^ ^ ^^^^ 

::r: 0. — of con,p,ex ... .... pHosp.a.es 

30 continuous addition oi a. 
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and containing only 25 nM, each, of trace metal and vitamin solutions. The feed 
addition continued until the of the culture reached 85, at which time (approximately 
9 h) the culture was induced with 0.1 mM isopropyl P-D-thiogalactopyranoside. During 
4 h of post-induction incubation, the culture was fed with a solution containing 100 g/1 
5 of glucose, 100 g/1 of yeast extract, and 200 g/1 of tryptone. Finally, the cells were 
harvested by centrifiigation (8000xg, 10 min) and frozen at -80-C until further 
processed. 

The cell pellet («400 g wet mass) containing C-SAP was resuspended in 
3 vol Buffer B (10 mM sodium phosphate pH 7.0, 5 mM EDTA, 5 mM EGTA, and 1 
1 0 mM dithiothreitol). TTie suspension was passed through a microfluidizer three times at 
124 Mpa on ice. The resultant lysate was diluted with NanoPure H,0 until conductivity 
fell below 2.7 mS/cm. All subsequent procedures were performed at room temperature. 

The diluted lysate was loaded onto an expanded bed of Streamline SP 
cation-exchange resin (300 ml) equilibrated with buffer C (20 mM sodium phosphate 
15 pH 7.0. 1 mM EDTA) at 100 ml/min upwards flow. The resin was washed with buffer 
C until it appeared clear. The plunger was then lowered at 2 cm/min while washing 
continued at 70 ml/min. Upwards flow was stopped when the plunger was 
approximately 8 cm away from the bed and the plunger was allowed to move to within 
0.5 cm of the packed bed. The resin was further washed at 70 ml/min downwards flow 
20 until A,«o reached baseline. Buffer C plus 0.25 M NaCl was then used to elute proteins 
containing C-SAP at the same flow rate. 

The eluate was buffer exchanged into buffer D (50 mM sodium borate 
pH 8.5, 1 mM EDTA) using the Sartocon Mini crossflow filtration system with a 10000 
NMolecular Massco module (Sartorius). The sample was then applied to a column of 
25 Source 15S (30 ml) equilibrated with buffer D. A 1 0-column-volume linear gradient of 
0-0.3 M NaCl in buffer D was used to elute C-SAP at 30 ml/min. 

F- Assav for cvtotoxic activity 

The ribosome inactivating protein activity of recombinant saporin was 
30 compared to the ribosome inactivating protein activity of native SAP in an in vitro 
assay measuring cell-free protein synthesis in a nuclease-treated rabbit reticulocyte 
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lysate (Promega). Samples of immunoaffinity-purified saponin were diluted in PBS and 
5 ^l of sample was added on ice to 35 ^1 of rabbit reticulocyte lysate and 10 |il of a 
reaction mixture containing 0.5 jil of Brome Mosaic Virus RNA, 1 mM amino acid 
mixture minus leucine, 5 ^Ci of tritiated leucine and 3 ^1 of water. Assay tubes were 
5 incubated 1 hour in a SO^C water bath. The reaction was stopped by transferring the 
tubes to ice and adding 5 jil of the assay mixture, in triplicate, to 75 lal of 1 N sodium 
hydroxide, 2.5% hydrogen peroxide in the wells of a Milliliter HA 96-weIl filtration 
plate (Millipore). When the red color had bleached from the samples, 300 ^l of ice cold 
25% trichloroacetic acid (TCA) were added to each well and the plate left on ice for 
10 another 30 min. Vacuum filtration was performed with a Millipore vacuum holder. 
The wells were washed three times with 300 ^l of ice cold 8% TCA. After drying, the 
filter paper circles were punched out of the 96-well plate and counted by liquid 
scintillation techniques. 

The ICjo for the recombinant and native saporin were approximately 20 
15 _pM. Therefore, recombinant saporin-containing protein has fiill protein synthesis 
inhibition activity when compared to native saporin. 



EXAMPLE 2 

20 Preparation of FGF Muteins 

A. Materials and Methods 
1. Reagents 

Restriction and modification enzymes were purchased from BRL 
25 (Gaithersburg, MD), Stratagene (La JoIIa, CA) and New England Biolabs (Beverly, 
MA). 

Plasmid pFC80, containing the basic FGF coding sequence, was a gift of 
Drs. Paolo Sarmientos and Antonella Isacchi of Farmitalia Carlo Erba (Milan, Italy). 
Plasmid pFC80, has been described in the PCT Application Serial No. WO 90/02800 
30 and PCT Application Serial No. PCTAJS93/05702, which are herein incorporated in 
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forth ,„ PCT Apphcaton Serial No. PCT/US93/05702 and in SEQ ID NO 52 

M,3 „a„.p„,a„„„. ca^ed o,,. according ,„ p„Hished procedures (San,brook e. 
C id Spnns Har^, ^y, ,,,,, ^^^^^ J • 

^e Oenccean „ p^chased Bio ,„, ,.a;o«a. CA). Se^nencin, of Z 
^« co„„. „a. perfo^ed .in. Selena. ..^.^^B 



10 



«,s rPh ^'-"•''Pl'o-is was perfonned on a Phas,Sys,e„ n,i,i^„g 2„.,, 

(P am,ac.a, Wesrcm Wo„ing was acconrpHshed ,y transfer of e,ec.opho.esed 
p™.e,n .o n,.oce,.n,ose usin, U,e Phas.W,„ ,p^^^^^^ J^ J^ 

h1 d <....ionof ,..,000 

(Davs. L., D,b„er e, a,. („86) Basic Methods in Moiecuiar Biology, p. , Elsevier 
Science Publishing Co., New York). 

e^P^"""" "f h- n^mnrnni^r 1 rqr l-Mtedirec^jmiagg^ 
murae ■ '° ""^ o,igo„uc,eo.ide-direc.ed 

2. . 0„go„uc,eo,.des encoding ,he new an,ino acid were synthesis using a 380B 
auiontauc DNA synthesizer (Applied Biosysten,s, Foster City, CA). 

^- Mutagene<;i<; 

The oligonucleotide used for in vitro mutagenesis of cvstein. 78 

30 -AAA.co:i.er;~ ^^^^^ - - 

~ -HP ,0 NO. .he stated repl.catiye fo^ 
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£. coli strain JM109 and single plaques were picked and sequenced for verification of 
the mutation. The FGF mutated gene was then cut out of Ml 3, ligated into the 
expression vector pFC80, which had the non-mutated form of the gene removed, and 
transformed into E. coli strain JM109. Single colonies were picked and the plasmids 
5 sequenced to verify the mutation was present. Plasmids v^th correct mutation were 
then transformed into the E, coli strain FICE 2 and single colonies from these 
transformations were used to obtain the mutant basic FGFs. Approximately 20 mg 
protein per liter of fermentation broth was obtained. 

10 2. Purification of mutagenized FGF 

Cells were grown overnight in 20 ml of LB broth containing 100 jag/ml 
ampicillin. The next moming the cells were pelleted and transferred to 500 ml of M9 
medium with 100 |ig/ml ampicillin and grown for 7 hours. The cells were pelleted and 
resuspended in lysis solution (10 mM TRIS, pH 7.4, 150 mM NaCl, lysozyme, 10 

15 g/mL, aprotinin, lOjig/mL, leupeptin, 10|ig/mL, pepstatin A, lOfig/mL and 1 mM 
PMSF; 45-60 ml per 1 6 g of pellet) and incubated while stirring for 1 hour at room 
temperature. The solution was frozen and thawed three times and sonicated for 2.5 
minutes. The suspension was centrifuged; the supernatant saved and the pellet 
resuspended in another volume of lysis solution without lysozyme, centrifuged again 

20 and the supematants pooled. Extract volumes (40 ml) were diluted to 50 ml with 
lOmM TRIS, pH7.4 (buffer A). Pools were loaded onto a 5 ml Hi-Trap heparin- 
Sepharose column (Pharmacia, Uppsala, Sweden) equilibrated in 150mM sodium 
chloride in buffer A. The column was washed with 0.6 M sodium chloride '%nd 1 M 
sodium chloride in buffer A and then eluted with 2 M sodium chloride in buffer A. 

25 Peak fractions of the 2 M elution, as determined by optical density at 280 nm, were 
pooled and purity determined by gel electrophoresis. Yields were 1 0.5 mg of purified 
protein for the Cys^^ mutant and 10.9 mg for the Cys^^ mutant. 

The biological activity of [C78S]FGF and [C96S]FGF was measured on 
adrenal capillary endothelial cells in culture. Cells were plated at 3,000 per well in a 24 

30 well plate in 1 ml of 10% calf serum-HDMEM. Cells were allowed to attach, and 
samples were added in triplicate at the indicated concentration and incubated for 48 h at 
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37X. An equal quantity ofsamples was added and further incubated for 48 h Medium 
was aspirated; cells were treated with trypsin (1 ml volume) to remove cells to 9 ml of 
Hematall diluent and counted in a Coulter Counter. The results show that the two 
mutants that retain virtually complete proliferative activity of native basic FGF as 
5 judged by the ability to stimulate endothelial cell proliferation in culture 



EXAMPLES 
Preparation of Mono-Derivatized Nucleic Acid 
Binding Domain (MyoD) 

MyoD a. a concenmtion of 4.1 mg/ml is dialyzed agains, O.I M sodi™ 
phosphate. 0.1 M sodi™ chloride. pH7.5. A U moto excess (563 in 156 
m of anhydrous ertanol) of SPDP (Pha^acia, Uppsala. Sweden) is added and d,e 
reacon mixture immediately agitated and put on a rocker platform for 30 minmes TTte 
solufon is then dialyzed agains. the same buffer. An ali<,uo, of the dialyzed solution is 
examined for extern of derivatization according to the Phannacia insmtcion sheet nc 
extern of denvaUzation is typically 0.79 to 0.86 moles of SPDP per mole of nucleic acid 
binding domain. 

2» Derivatized myoD (32.3 mg) is dialyzed in 0. 1 M sodium borate pH 9 0 

and applied to a Mono S 16/10 column equilibrated with 25 mM sodium chloride in 
dtalysis buffer. A gradient of 25 mM to 125 mM sodium chloride in dialysis buffer 
elutes f^ee and derivatized nucleic acid binding domain. He flow rate is 4.0 ml/min 
4 ml fractions are collected. Ali,uots of fictions were assayed for protein' 
concentrauon (BCA Protein Assay, Pie,.e Chemical. Chicago. IL) and for pyridylthione 
released by reducing agent. Individual fractions (25 to 37) are analyzed for protein 
concentration and pyridyMisulfide concentration. The data indicate a separation 
according to the level of derivatization by SPDP. The initial eluting peak is composed 
of myoD that is approximately di^derivatized; the second peak is mono-derivatized and 
the thtrd peak shows no derivatization. The di.deriva,ized material accounts for 
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approximately 20% of the three peaks; the second accounts for approximately 48% and 
the third peak contains approximately 32%. Material from the second peak is pooled 
and gives an average ratio of pyridyl-disulfide to myoD of 0.95. Fraction 33, which 
showed a divergent ratio of pyridine-2-thione to protein, was excluded from the pool. 
5 Fractions that showed a ratio of SPDP to myoD greater than 0.85 but less than 1 .05 are 
pooled, dialyzed against 0.1 M sodium chloride, 0.1 M sodium phosphate, pH 7.5 and 
used for derivatization with basic FGF. 



1" EXAMPLE 4 

Preparation of Modified Nucleic Acid Binding Domain (MyoD) 

As an alternative to derivatization, myoD is modified by addition of a 
cysteine residue at or near the N-terminus-encoding portion of the DNA. The resulting 
15 myoD can then react with an available cysteine on an FGF or react with a linker or a 
linker attached to an FGF to produce conjugates that are linked via the added Cys. 

Modified myoD is prepared by modifying DNA encoding the myoD 
(GenBank Accession No. X56677). DNA encoding Cys is inserted at position -1 or at a 
codon within 10 or fewer residues of the N-terminus. The resulting DNA is inserted 
20 into pETl 1 a and pETl 5b and expressed in BL2 1 cells (NO VAGEN, Madison, WI). 

^- Preparatio n of mvoD with an added cysteine residue at the N-terminiis; 

Primer #l»corresponding to the sense strand of myoD, nucleotides 121- 
144, incorporates a Ndel site and adds a Cys codon 5' to the start site for the mature 
25 protein 

5'-CATATGTGTGAGCTACTGTCGCCACCGCTC-3' (SEQ ID NO. 58) 

Primer #2 is an antisense primer complementing the coding sequence of 
30 nucleic acid binding domain spanning nucleotides 1054-1077 and contains a BamHl 
site. 
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5'- 



OOATCCGAGCACCTOOTATATCGOTGGOGO-3' (SEQ ID NO. 59) 



'^^°''°''^'^™''"««"'y''CR as follows u.i„g the above primers A 
clone con^^, , ^^^^^ ^^^^ ^_ ^ 

Mga„ 0.2^ ^TPs. 0.S Of each prime.. Hex, 2.5 U Ta,, onI' 
polymerase <Boehri„,er Mannheim) is added and U,e mixture is overlaid widr 30 „ of 
' <^'^>- - in a DNA T.erma, CyCer. C.c,es inciude a 

enan^^on s,ep (9.X for , min). an anneaiin, s,ep (ao=c for 2 min), and an 
on,a^ s.ep for 3 min, A«er 35 eycies, a ,0 „ a,i,uo. of each .acion is ^ 
on a 1 .5/. agarose gel ,o verify ,he coTec, structure of the amplified pt^duc, 

The amplified DNA is gel purified and digested with Mel and Ba.m 
and subcloned into N.el and i><,„„,-diges,ed plasmid containing FGF/myoD ™s 

1*::;'™""" ~ -sap 

w t^ DNA encodtng a myoD molecule that contains a cysteine residue a. position -1 
relative to the start site of the native mantre SAP protein. 



20 B. 



4 or 10 of the nattve protein by replacing the Ser residue at position 4 or the Va, residue 
at posiuon 1 0 with cysteine. 

MyoD is amplifled by polymerase chain reaction (PCR) from the 
parental plasmid encoding the FGF-nucleic acid binding domain ftrslon protein using 
pnmers that incorporate a TGT or TGC codon at position 4 or 1 0. 

foil • "--bed above, using the 

fo lowmg cycles: denaturation step 94-C for 1 minute, annealing for 2 minutes at lo'C 
and extenston for 2 minutes at 72'C for 35 cycles. The amplifled DNA is gel purified' 
*gested with A-..1 and B.n.m. and subcloned into «l and Ba.m digested pETI la' 
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This digestion removes the FGF and 5' portion of nucleic acid binding domain (up to 
the newly added BamHl) from the parental FGF- myoD vector and replaces this portion 
with a myoD molecule containing a Cys at position 4 or 10 relative to the start site of 
the native protein. 

5 The resulting plasmid is digested with NdellBamHl and inserted into 

pET15b (NOVAGEN, Madison, WI), which has a His-Tag™ leader sequence (SEQ ID 
NO. 60), that has also been digested Ndel/BamHl. 

DNA encoding unmodified myoD can be similarly inserted into a pET5b 
or pETl 1 A and expressed as described below for the modified SAP-encoding DNA. 

10 

C Expressio n of the modified nucleic acid binding domain-encoding DNA 

BL21(DE3) cells are transformed with the resulting plasmids and 
cultured as described in Example 2, except that all incubations were conducted at 30°C 
instead of 37^*0. Briefly, a single colony is grown in LB AMPjoq to and OD^oo of 1 .0-1 .5 
15 and then induced with IPTG (final concentration 0.1 mM) for 2 h. The bacteria are 
spun down. 

D. Purification of modified nucleic acid binding domain 

Lysis buffer (20 mM NaP04, pH 7.0, 5 mM EDTA, 5 mM EGTA, 1 mM 

DTT, 0.5 |ig/ml leupeptin, 1 |ig/ml aprotinin, 0.7 |ig/ml pepstatin) was added to the 
20 myoD cell paste (produced from pZ50Bl in BL21 cells, as described above) in a ratio 

of 1 .5 ml buffer/g cells. This mixture is evenly suspended via a Polytron homogenizer 

and passed through a microfluidizer twice. 
^ The resulting lysate is centrifiiged at 50,000 rpni for 45 min. The 

supernatant is diluted with SP Buffer A (20 mM NaP04, 1 mM EDTA, pH 7.0) so that 
25 the conductivity is below 2.5 mS/cm. The diluted lysate supernatant is then loaded onto 

a SP-Sepharose column, and a linear gradient of 0 to 30% SP Buffer B (1 M NaCl, 

20 mM NaP04, 1 mM EDTA, pH 7.0) in SP Buffer A with a total of 6 column volumes 

is applied. Fractions containing myoD are combined and the resulting mucleic acid 

binding domain had a purity of greater than 90%. A buffer exchange step is used to get 
30 the SP eluate into a buffer containing 50 mM NaB03, 1 EDTA, pH 8.5 (S Buffer 

A). This sample is then applied to a Resource S column (Pharmacia, Sweden) pre- 
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equilibrated wid, S Buffer A. Pure nudeic acid binding domain is eluted off 1,= 
column by 10 column volumes of a linear gradient of 0 to 300 mM NaCl i„ SP Buffer 

A. 

In this preparation, ultracentriftigation is used clarify the lysate- other 
methods, such as filtration and using floculents also can be used. In addition 
St^line S (PHARMACIA. Sweden) may also be used for large scale preparations ' 



EXAMPLE 5 

' ° - Preparation of Conjugates Containing FGF Muteins 

Coupling of FGF mutein. tn n.. . leic acid hinHino H ,„ 

^" Chemical Svnthesi-; nf r r7^siFrrF nuHdr ^r>:A u- j- 

15 [C78S]FGF or [C96S]FGF (1 mg; 56 nmol) that had been dialyzed 

agmnst phosphate-buffered saline is added to 2.5 mg mono-derivatized nucleic acid 
bmdzng domain (a 1.5 molar excess over the basic FGF mutants) and left on a rocker 
platform overnight. The next monung the ultraviolet-visible wavelength spectrum is 
taken to determine the extent of reaction by the release of pyridylthione, which adsorbs 
at 343 nm with a known extinction coefficient. The ratio of pyridylthione to basic FGF 
mutant for [C78SJFGF is 1.05 and for [C96S]FGF is 0.92. TT.e reaction mixtures are 
treated identically for purification in the following manner: reaction mixture is passed 
over a.HzTrap heparin-Sepharose column (I ml) equilibrated with 0.15 M sodium 
chlonde in buffer A at a flow rate of 0.5 ml/min. The column is washed with 0 6 M 
NaCI and 1.0 M NaCl in buffer A and the product eluted with 2.0 M NaCl in buffer A 
Fractions (0.5 ml) are analyzed by gel electrophoresis and absorbance at 280 mn Peak 
tubes are pooled and dialyzed versus 10 mM sodium phosphate, pH 7.5 and applied to a 
Mono-S 5/5 column equilibrated with the same buffer. A 10 ml gradient bet^..en 0 and 
1.0 M sodium chloride in equilibration buffer is used to elute the product. Purity is 
30 determined by gel electrophoresis and peak fractions were pooled. 
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Under these conditions, virtually 100% of the mutant FGFs reacts with 
mono-derivatized myoD. Because the free surface cysteine of each mutant acts as a free 
sulfhydryl, it is unnecessary to reduce cysteines after purification from the bacteria. 
The resulting product is purified by heparin-Sepharose (data not shown), thus establish- 
5 ing that heparin binding activity of the conjugate is retained. 

2. Expression of the recombinant FGFC78/96S"nucleic acid binding 
domain fusion proteins rFPFN4) 

A two-stage method is used to produce recombinant FGF[C78/96S]- 
10 myoD protein (hereinafter FPFN4). Two hundred and fifty ml of LB medium 
containing ampicillin (100 |ig/ml) are inoculated with a fresh glycerol stock of bacteria 
containing the plasmid. Cells are grown at 30°C in an incubator shaker to an OD^ of 
0.7 and stored overnight at 4**C. The following day, cells are pelleted and resuspended 
in fresh LB medium (no ampicillin). The cells are divided into 5 1 -liter batches and 
15 grown at 3p°C in an incubator shaker to an OD^qq of L5. IPTG is added to a final 
concentration of 0.1 mM and growth is continued for about 2 to 2.5 hours at which time 
cells were harvested by centrifiigation. 

20 EXAMPLE 6 

Recombinant Production of FGF-Nucleic Acid 
Binding Domain Fusion Protein 

A. General Descriptions 

25 1. Bacterial Strains and Plasmids 

E. coli strains BL21(DE3), BL21(DE3)pLysS, HMS174(DE3) and 
HMS174(DE3)pLysS were purchased from NOV AGEN, Madison, WI. Plasmid pFC80, 
described below, has been described in the WIPO International Patent Application 
No. WO 90/02800, except that the bFGF coding sequence in the plasmid designated 

30 pFC80 herein has the sequence set forth as SEQ ID NO. 52, nucleotides 1-465. The 
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P^mids descnbed he.i„ be prepared using pFCSO . a s,a„i„, _ „, 

ID NO. 6 1 ) l.nked to th= FGF-encoding DNA (SEQ ID NO. 52). 

£ c»/,- strain JA22. (,pp- hdsM. fpES leuBS lacY recA. PflacH lac- 

pro ], . puHicly available «,e American Type Cul^ Collection <ATCC, 

Roc V I MB ^^^^ ^^^^ ^^^^^ 

" """" " '^'^ ' — 

^' gonslniction of plasmiri, ^.^ Unp Fr.F/n„ .l.;. .... ^ . 

Constmction of FGFMI % ih., .,=..^ ,. , ., 

binding sit,, linked in Fr.F encoding ihf ,-1 .,k„„n,r 

A Nco I restnction site is introduced into the nucleic acid binding 
domain-encoduig DNA by si.e-direc.ed mutagenesis using die Amersba™ ,„ „ J 
mutagenesis syste™ 2.1. oligonucleotide employed to cate the , restriction 
^e IS synthesized using a 3S0B automatic DNA synthesizer (Applied Biosystems, 
Oligonucleotide containing die , site replace, die oHginal nucleic acid bindin 
domam-containing coding sequence. 

In order to produce a bFGF coding sequence in which die stop codon 

25 Id"""? ' " - -"ie-^ .0 

site-direced mutagenesis. Plasmid pFCSO is a derivative of pDS20 ^ D„,„„ 

e.aJ., CW/ ..:g55-8«, 1,82; see also U.S. Patent Nos. 4,14 02 5 0^,744 
^..00,7S4, and 5,13.2.. ,so POT Intetnational Application ^Wo 
a.d European Patent Application No. EP 267703 Al,. which is almost the same as' 
P^asmid pKGiaOO ,see Bernard! et al., 0^l. Se,.e„ee .147.150. 1„0; see .so 
McKet^ey et <,,a„ pp. 3S3.I5 in Oe„e .^^e...„ , ,„,^,^ „^ 

Nue,e,.Ae„s En^^nc Me,Ho,s. Chirikjian et al. (eds.,, Nord, Holland Publishing 
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Company, Amsterdam) except that it contains an extra 440 bp at the distal end of galK 
between nucleotides 2440 and 2880 in pDS20. Plasmid pKGlSOO includes the 2880 bp 
EcoR l-Pvu II of pBR322 that contains the contains the ampicillin resistance gene and 
an origin of replication. 
5 Plasmid pFCSO is prepared from pDS20 by replacing the entire galK 

gene with the FGF-encoding DNA of SEQ ID NO. 52, inserting the trp promoter (SEQ 
ID NO. 62) and the bacteriophage lambda cll ribosome binding site (SEQ. ID No. 61; 
see, e.g., Schwarz et aL, Nature 272:410, 1978) upstream of and operatively linked to 
the FGF-encoding DNA. The Trp promoter can be obtained from plasmid pDR720 
10 (Pharmacia PL Biochemicals) or synthesized according to SEQ ID NO. 62. Plasmid 
pFC80, contains the 2880 bp EcoR I-BamH I fragment of plasmid pSD20, a synthetic 
Sal l-Nde I fragment that encodes the Trp promoter region: 



EcoK I 

15 MIieCCCTGTTGACAATTAATCATCGAACTAGTTAACTAGTACGCAGCTTGGCTGCAG 

and the cll ribosome binding site (SEQ ID NO. 61)): 

Sal I I 
20 GTCGACCAAGCTTGGGCATACATTCAATCAATTGTTATCTAAGGAAATACTTA CATATG 

The FGF-encoding DNA is removed from pFC80 by treating it as 
follows. The pFC80 plasmid was digested by Hga I and 5^/ 1, which produces a 
fragment containing the CII ribosome binding site linked to the FGF-encoding DNA. 

25 The resulting fragment is blunt ended with Klenow*s reagent and inserted into 
M13mpl8 that has been opened by Smal and treated with alkaline phosphatase for 
blunt-end ligation. In order to remove the stop codon, an insert in the ORI minus 
direction is mutagenized using the Amersham kit, as described above, using the 
following oligonucleotide (SEQ ID NO. 63): GCTAAGAGCGCCATGGAGA, which 

30 contains one nucleotide between the FGF carboxy terminal serine codon and a Nco I 
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restriction site; it replaces the following wild type FGF encoding DNA having SEQ ID 
NO. 64: 6 v»i-' 



GCT AAG AGC TGA CCA TGG AGA 
Ala Lys Ser STOP Pro Trp Arg 

The resulting mutant derivative of MI3mpl8, lacking a native stop 
codon after the carboxy terminal serine codon of bFGF, was designated FGFM13 The 
mutagenized region of FGFM13 contained the correct sequence (SEQ ID NO 65) 



t 



10 



Preparation of a plasmid that enrodes the FOFA/fy ^i^ fi,,i„„ 

Plasmid FGFM13 is cut with Ncol and Sad to yield a fragment 

contaimng the CII ribosome binding site linked to the bFGF coding sequence with the 

Stop codon replaced. 

^ M13mpl8 derivative containing the myoD coding sequence is also 
cut wuh restriction endonucleases Nco I and Sac 1, and the bFGF coding fragment from 
FGFMI3 was inserted by ligation to DNA encoding the fusion protein bFGF- myoD 
into the M13mpI8 derivative to produce mpFGF- myoD, which contains the CII 
nbosome binding site linked to the FGF-nucleic acid binding domain fusion gene. 

^^^""'"^ "'P^^^- "^y^D is digested with Xbal and EcoR I and the 
resulting fragment containing the bFGF- myoD coding sequence is isolated and ligated 
mto plasmid pET-lla (available from NOVAGEN, Madison, WI; for a description of 
the plasmids ... U.S. Patent No. 4,952,496; ... also Studier et al., MetH. En. 185-60. 
89, 1990; Studier et al., J. Mol. Biol. /<?P:1 13-130, 1986; Rosenberg et al Gene 
25 5(5:125-135, 1987) that has also been treated with EcoR I and Xba I. 

E. coli strain BL2](DE3)pLysS (NOVAGEN, Madison WI) may be 
transformed with the plasmid containing the fusion gene. 

Plasmid FGF/myoD may be digested with EcoR I, the ends repaired by 
addmg nucleoside triphosphates and Klenow DNA polymerase, and then digested with 
30 Nde I to release the FGF-encoding DNA without the CII ribosome binding site This 
fragment is ligated into pET 1 la, which is BamU I digested, treated to repair the ends 
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and digested with Nde I. The resulting plasmid includes the T7 transcription terminator 
and the pET-1 la ribosome binding site. 

Plasmid FGF/myoD may be digested with £coR I and Nde I to release 
the FGF-encoding DNA without the CII ribosome binding site and ends are repaired as 
5 described above. This fragment may be ligated into pET 12a, which had been BamYi I 
digested and treated to repair the ends. The resulting plasmid includes DNA encoding 
the OMP T secretion signal operatively linked to DNA encoding the fiision protein. 
3. Preparatio n of a plasmid that encodes FGF2-protamine fusion p rotein 

Protamines are small basic DNA binding proteins, approximately 6.8 kD 
10 in molecular weight with a isoelectric point of 12.175. Twenty-four of the fifty one 
amino acids are strongly basic. Human protamine has been shown to condense genomic 
DNA for packaging into the sperm head. The positive charges of the protamine react 
with the negative charges of the phosphate backbone of the DNA. 

A FGF-protamine fusion protein that has the ability to bind to the FGF 
15 receptor and bind DNA with high affinity is constructed for expression in E, coli. The 
sequence for the human protamine gene is obtained fi-om GenBank (accession no. 
Y00443). Four overiapping oligonucleotides (60mers) are generated and used to 
amplify the protamine gene. The amplified product is purified and ligated into the 
bacterial expression vector pETlla G^ovagen). To facilitate subcloning, a Ncol and 
20 BaniiW site are incorporated into the primers. The fragment is synthesized by annealing 
the 4 oligos (2 sense and 2 antisense) with 20 base overlaps and using PGR to fill-in and 
amplify the fragments. The PGR products are digested with Ncol and 5a/«HI, and 
subcloned into pBluescript SK+. The insert sequence is verified. The sequenced 
product is then cloned downstream and in-frame with FGF2, which has been previously 
25 cloned into the pETl la expression plasmid. The oligos used to generate fragment A are 
(5^-3'): 

PTl: 

TACATGCCATGGCCAGGTACAGATGCTGTCGCAGCCAGAGCCGGAGCAGAT 
30 ATTACCGCC (SEQ ID NO.: 85) 
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PT2: 

GCAGCTCCGCCTCCTTCGTCTGCGACTTCTTTGTCTCTGGCGGT^^^^ 
TCCGGCT (SEQ ID NO.: 86) A l c 1 OC 

PT3: 

GACGAAGGAGGCGGAGCTGCCAGACACGGAGGAGAGCCATGAGGTGCTGC 
CGCCCCAGGT (SEQ ID NO.: 87) '-'-I GCTGC 



10 PT4: 



ATATATCCTAOGrrAOTOTCTTCTACATCTCOOTCTOTACCTOGOOCOOCAO 
CACCTCA(SEQIDNO.:88) '-UOCAO 

„ '^*'"""'™"'^*"='«"^.Bl-21(DE3),ar.UansfomedwittthepETn- 

.00 Mg/., a„,pi.n,u,. A glycerol stock ™ade fton, an individual colon, added to , ^ 
fesh LB broth and then .„ 250 m, of LB broth. The cells are grown to an OD^ of 0 7 
and induced with IPTG. The cul„.e is harvested 4 hours after induction. T,e 
suspension is centtifltged. the supernatant is saved and ^ pellet is resuspended in lysis 
buffer, centriinged again and the supen,ata„ts pooled. A san,ple of the pellet and the 
supen^tant are analyzed by Western analysis using antibodies ,o FOF2 ,o determine the 
percentage of fusion protein within each fraction. Soluble protein is purified. Briefly 
A= cells arc pelleted and resuspended in buffer A (10 „,M sodiun, phosphate, pH 6 o' 
containing 10 EDTA, 10 „M EGTA and SO n*. NaCl) and passed through a' 
..cronuidizer (Micro^uidics Corp., Newton, MA) to break open Ute bacteria and shear 
DNA. The resultant n,ixn.e is diluted and loaded onto an expanded bed Strean,line SP 
catton-exchange resin. Tl,e colunm is washed witi, step gradients of increasing 
concentrations of NaCl. The eluted ntaterial is analyzed by Wcsten, analysts for 
ftacttons containing the fusion protetn. These fractions are pooled, diluted, and loaded 
onto a Heparin-Sepharose affnity column. Afier washing, the bound proteins are eluted 
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in a batch-wise manner in buffer containing 1 M NaCl and then in buffer containing 2 
M NaCL Peak fractions of the 2M elution, as determined by optical density at 280 nm, 
are pooled and the purity determined by gel electrophoresis and Western analysis. The 
final pool of material will be loaded onto a column of Sephacryl S-100 equilibrated 
.5 with 20 mM HEPES pH 7.4, 1 50 mM NaCI. 

Fusion protein located in the pellet is isolated, solubilized and refolded. 
Briefly, each culture pellet is thawed completely and resuspended in buffer A (10 mM 
Tris, 1 mM EDTA, pH 8.0 + 0.1 mg/ml lyzozyme). The mixture is sonicated on ice, 
centrifiiged at 16,000 X g, and the supernatant discarded. Inclusion bodies are 
10 solubilized with solubilization buffer: (6 M guanidine-HCl, 100 mM Tris, 150 mM 
NaCI, 50 mM EDTA, 50 mM EGTA, pH 9.5,), vortexed, incubated for 30 minutes at 
room temperature, and centrifiiged at 35,000 X g for 15 minutes. The supernatant is 
saved and diluted 1:10 in dilution buffer (100 mM Tris, 10 mM EDTA, 1% 
monothioglycerol, 0.25 M L-arginine, pH 9.5). The material is stirred, covered, at 4°C 
15 for 2 hours and then centrifiiged at 35,000 X g for 20 minutes. The supernatant is 
dialyzed in against 5 liters PBS, pH 8.8, for 24 hours at 4°C with 3 changes of fresh 
PBS. The material is concentrated approximately 10-fold using size-exclusion spin 
columns. The soluble refolded material is then analyzed by gel electrophoresis. 

Expression of the FGF-protamine fiision protein can be achieved in 
20 mammalian cells by excising the insert with restriction enzymes Ndel and BamHl and 
ligating into a mammalian expression vector. 

C- Expression of the recombinant bFGF-nucleic acid binding domain fusion 
proteins 

A two-stage method is used to produce recombinant bFGF-myoD protein 
(hereinafter bFGF-nucleic acid binding domain fiision protein). 

Three liters of LB broth containing ampicillin (50 |ig/ml) and 
chloramphenicol (25 )ag/ml) are inoculated with pFS92 plasmid-containing bacterial 
cells (strain BL21(DE3)pLysS) from an overnight culture (1:100 dilution). Cells are 
grown at 37**C in an incubator shaker to an OD^oo of 0.7. IPTG (Sigma Chemical, 



25 



I 

30 
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S. L„^3, MO, is added ,o a f.„a, c„„™,„„ ^ 
for 1 .5 hours at which time cells were centrifiiged. 

Experiment, have shown that growing BL21(DE3)pLy.S cells at 30'C 
utstead Of 37-C impt^ves yields. TTtns, cells are grown a, 30-C to an 0D„ of 1 S pi! 

5 to,„d.c.ion. following induction, growth is continued for ahout. to Z5 hoi 
which time the cells are harvested by centrifiigation. 

TTie pellet is resuspended in lysis solution (45-«0 ml per 16 g of pellet- 
2 r^ TRIS, PH.... 5mM EDTA. lo-^sucse, 130mM HaCI, lysojme, ^ 
g^.1, aprotmm. 10 ,g/m,, leupepth. 10 Mg/ml. pepstatin A. 10 ,g/m, and I mM PMSf' 
mcuMed With stirring for 1 hour at room temperanire. TTie solution Is Wn J 
^awed thr. ti and sonicated for 2.5 minutes. T.e suspension is centrifiiged a. 
12.000 X g f , the resulting first-supe^atan. saved and the pellet is resuspended 
n anoUier volume of lysis solution without lyso^me. .suspended materia, is 

15 ZTrr" '° ""^^ ' and the two supematants are pooled 

15 and dialyzedagamst borate buffered saline. pH 8.3. 

^ffl""V P nrinr.,ion of hFOr n ueieioci^h,dingd^^ 

nnrty ml of the dialyzed solution containing the bFGF-nucleic acid 
bmding domain liision protein from Example 5.C. is applied ,o HiTrap 
he^n- epharose column (Pha^acia, Uppsala Sweden, e,ui,ibrated with 0.15 M 
NaClm I0mMTRIS,pH7 4fbuni!rA\ Tk. , 

buff-„. . . "ashed first with equilibration 

buff r. , 0.,^ ^^^^^^^ ^^^^ 

by the ELISA method. 

25 bFGF-nucleic acid binding domain fusion protein elu.es ftom the 

hepann-Sepharose column at the same concent^tion (2 M NaCl, as native and 

rFoTs'irr''"''''"' bFOF, indicating that the heparin affinity is retained in ^ 

DhOF-SAP fusion protein. 
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E- Characterization of t he bFGF-nucleic acid binding domain fusion protein by 
Western blot 

SDS gel electrophoresis is performed on a Phastsystem utilizing 20% 
acrylamide gels (Pharmacia). Western blotting is accomplished by transfer of the 
5 electrophoresed protein to nitrocellulose using the PhastTransfer system (Pharmacia), as 
described by the manufacturer. Antisera to bFGF is used at a dilution of 1:1000. 
Horseradish peroxidase labeled anti-IgG is used as the second antibody (Davis et al., 
Basic Methods in Molecular Biology, New York, Elsevier Science Publishing Co., 
pp 1-338, 1986). 

0 Anti-FGF antisera should bind to a protein with an approximate 

molecular weight of 53,000, which corresponds to the sum of the independent 
molecular weights of nucleic acid binding domain (35,000) and bFGF (18,000). 



15 EXAMPLE? 

Preparation of FGF-Nucleic Acid Binding Domain Conjugates That Contain " 
Linkers Encoding Protease Substrates 

A. Synthesis of oligos encoding protease substrates 
20 Complementary single-stranded oligos in which the sense strand encodes 

a protease substrate, have been synthesized either using a cyclone machine (Millipore, 
MA) according the instructions provided by the manufacturer, or were made by 
Midland Certified Reagent Co. (Midland, TX) or by National Biosciences^ Inc. (MN). 
The following oligos have been synthesized. 

25 

1 . Cathepsin B substrate linker 

5'- CCATGGCCCTGGCCCTGGCCCTGGCCCTGGCCATGG SEQ ID NO: 66 

2. Cathepsin D substrate linker 

5 ' - CCATGGGCCGATCGGGCTTCCTGGGCTTCGGCTTCCTGG 
30 GCTTCGCCAT GG -3* SEQ ID NO: 67 

3. Trypsin substrate linker 

5 • - CCATGGGCCGATCGGGCGGTGGGTGCGCTGGTAATAGAGT 



wo 96/36362 



PCT/US96/07164 



110 



CAGAAGATCAGTCGGAAGCAGCCTGTCTTGC6GTGGTCTC 
GACCTGCAGG CCATGG-3' SEQ ID NO: 68 
4. Gly.Ser 

5"- CCATGGGC6G CGGCGGCTCT GCCATG6 -3' SEQ ID NO- 47 
^ 5. (Gly,Ser)j 

5 ■ - CCATGGGCGGCGGCGGCTCTGGCGGCGGCGGCTC 
TGCCATGG -3' SEQ ID NO: 48 

6. (Ser.Gly), 

5 - - CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTC 
1 0 GGGCTCGTCGTCGTCGGGCTCGTC6TCGTCGGGC 
GCCATGG -3' SEQ ID NO: 49 

7. (Ser.Gly), 

5- CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTC 
GGGCGCCATGG -3" SEQ ID NO: 50 

8. Thrombin substrate linker 
CTG GTG CCG CGC GGC AGC SEQ ID NO. 69 
Leu Val Pro Arg Gly Ser 

9. Enterokinase substrate linker 
GAC GAC GAC 6AC CCA SEQ ID NO. 70 

20 Asp Asp Asp Asp Lys 

10. Factor Xa substrate 
ATC GAA GGT CGT SEQ ID NO. 71 
He Glu G1y Arg 

The complemantmy oligos are am,ealed by heating at 95'C for 15 mi„ 
cooled to roon, tetnperature, and then incubated at 4°C for a minute ,o about an hour' 
Following incubation, the oHgos are digested with Uco, and ligated ovenngh. at a 31 
Onsert:vector) ratio a, I5=C to A-col-digested plas^id which ha. been treated with 
alkaline phosphatase (Boehringer Mannheim). 

Bacteria (Novablue (NOVAOEN, Madison. WI)) are transformed with 
.he Itgation mixture (, m, and plated on LB-amp or LB-Kan, depending upon the 
Ptemtd). Colonies are selected, clones isolated and sequenced to dateline orientation 
Of the inseri. Clones with co^ect orientation are used to transform strain expression 
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Strain BL21(DE3) (NOVAGEN, Madison, WI). Glycerol stocks are generated from 
single transformed colonies. The transformed strains are cultured as described in 
Example 2 and fusion proteins with linkers were expressed. 

The DNA and amino acid sequences of exemplary fusion proteins, 
5 containing cathepsin B substrate (FPFS9), cathepsin D substrate (FPFS5), Gly^Ser 
(FPFS7), (Gly,Ser)2 (FPFS8), trypsin substrate (FPFS6). (Ser^Gly)^ (FPFS12) and 
(Ser4Gly)2 (FPFSl 1) linkers, respectively, are set forth in SEQ ID NOs. 72-78. 

EXAMPLE 8 

1 0 Fgf-Poly-L-L YSINE (FGF2-K) Complexed With a 

Plasmid Encoding P-Galactosidase 

A. Derivatization of polv-L-lvsine 

Polylysine polymer with average lengths of 13, 39, 89, 152, and 265 
^39* K,52, K265) are purchased from a commercial vendor (Sigma, St. Louis 
MO) and dissolved in 0.1 M NaP04, 0.1 M NaCl, 1 mM EDTA, pH 7.5 (buffer A) at 
3-5 mg/ml. Approximately 30 mg of poly-L-lysine solution is mixed with 0.187 ml of 
3 mg/ml N-succinimidyl-3(pyridyldithio)proprionate (SPDP) in anhydrous ethanol 
resulting in a molar ratio of SPDP/poly-L-lysine of 1.5 and incubated at room 
20 temperature for 30 minutes. The reaction mixture is then dialyzed against 4 liters of 
buffer A for 4 hours at room temperature. 

B. Conjugation of defivatized polylysine to FGF2-3 

A solution containing 28.5 mg of poly-L-lysine-SPDP is added to 12.9 
25 mg of FGF2-3 ([C96SJ-FGF2) in buffer A and incubated overnight at 4°C. The molar 
ratio of poly-L-lysine-SPDP/FGF2.3 is approximately 1.5. Following incubation, the 
conjugation reaction mixture is applied to a 5 ml Resource S (Pharmacia, Uppsala, 
Sweden) column. A gradient of 0.15 M to 2.1 M NaCl in 20 mM NaP04, 1 mM 
EDTA, pH 8.0 (Buffer B) over 24 column volumes is used for elution. The FGF2- 
30 3/poly-L-lysine conjugate, called FGF2-K, is eluted off the column at approximately 
1.8-2 M NaCl concentration. Unreacted FGF2-3 is eluted off by 0.5-0.6 M NaCl. 
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10 



25 



30 



The fractions containing FGF2.K are concentrated and loaded onto a gel- 
filtration column (Sephacryl SlOO) for buffer exchange into 20 mM HEPES, 0.1 M 
NaCl, pH 7.3. The molecular weight of FGF-K152 as detennined by size exclusion 
HPLC ,s approximately 42 kD. To detennine if the conjugation procedure interferes 
w.th the ability of FGF2.3 to bind heparin, the chemical conjugate FGF2-K is loaded 
onto a heparin column and eluted off the column at 1.8- 2.0 M NaCl. In comparison 
unconjugated FGF2-3 is eluted off heparin at 1.4 - 1.6 M NaCl. TTus suggests that 
poly-L-lysine contributes to FGF2-3 ability to bind heparin. The ability of poly-L- 
lysme 152 to bind heparin is not determined; poly-L-lysine 84 elutes at approximately 
1.6 M NaCl. Histone Hl-polylysine was purchased and cytochrome C was conjugated 
to polylysine as described herein. 

A sample of FGF2-K is electrophoresed on SDS-PAGE under non- 
reducmg and reducing conditions. The protein migrates at the same molecular weight 
as FGF. Under non-reducing conditions the conjugate does not enter the gel because of 
' Its high charge density (Figure 1, lanes 1, 2, non-reducing; lanes 3, 4, reducing). 

A standard proliferation assay using aortic bovine endothelial cells is 
performed to determine if the conjugation procedure reduced the ability of FGF2-3 
ability to stimulate mitogenesis. The results reveal that FGF2-K is equivalent to FGF2- 
3 m stimulating proliferation (Figure 2). 

^ FGF2-3-polv-T..-lvsine-nnrlp; c acid comp lpv fr,r.>,...-^Ti 

Optimal conditions for complex formation are established. Varying 
quantities (0.2 to 200 ^g) of P-galactosidase encoding plasmid nucleic acid pSVp or 
pNASS-p (lacking a promoter) are slowly mixed with 100 ^g of FGF2-K in 20 mM 
HEPES PH7.3, 0.15 M NaCl. The reaction is incubated for 1 hour at room 
temperature. Nucleic acid binding to the FGF-Iysine conjugate is confirmed by gel 
mobility shift assay using 32p.,3be,ed SV40-P-gaI nucleic acid cut with Hincll 
restriction endonuclease. In brief, SV40p-gaI nucleic acid is digested with Hincll 
restriction endonucleases; ends are labeled by T, PNK following dephosphorylation 
with calf intestinal alkaline phosphatase. To each sample of 35 ng of 32p.,abeled 
nucleic acid increasing amounts of FGF-polylysine conjugate is added to the mixture 
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The protein/nucleic acid mixture is electrophoresed in an agarose gel with 1 X TAE 
buffer. Binding of the conjugate to the radiolabeled DNA is shown by a shift in the 
complex to the top of the well. (Figure 3.) As seen in Figure 3D, as little as 10 ng of 
Kj4 causes a complete shift of restriction fi-agments indicating binding. With K.j, 100 
5 ng of poIy-L-lysine was required (Figure 3C). With K,„, 10 ng was required 
(Figure 3E). 

The optimal length of poly-L-lysine and weight ratios is detennined by 
conjugation of FGF2-3 to poly-lysine of different lengths. DNA encoding 
p-galactosidase was complexed with the conjugates at 10:1, 5:1, 2:1, 1:1, and 0.5:1 
1 0 (Figure 4, lanes 1 -5, respectively) (w/w) ratios. The ability of these FGF2-K complexes 
to bind DNA was determined by measuring the ability of FGF to promote the uptake of 
plasmid DNA into cells. FGF2-K conjugates were evaluated at various protein to DNA 
ratios for their ability to deliver pSVp-gal DNA into cells (Figure 4). 

Briefly, the complexes were incubated for 1 hr at room temperature and 
15 then added to COS cells for 48 hrs. Cell extracts were prepared and assayed for P-gal 
enzyme activity. Briefly, cells are washed with 1 ml of PBS (Ca*' and Mg^^ fi-ee) and 
lysed. The lysate was vortexed and cell debris removed by centrifiigation. The lysate 
was assayed for P-gal activity as recommended by the manufacturer (Promega, 
Madison, WI). The P-gal activity was normalized to total protein. As seen in Figure 4, 
20 lane 3, a 2:1 (w/w) ratio of FGF2-K:DNA gave maximal enzyme activity. 

In addition, toroid formation, which correlates with increased gene 
expression, was assessed by electron microscopy. A representative toroid at a protein to 
DNA ratio of 2:1 is shown in Figure 5, upper panel. Toroidal structures are absent, or 
only partially formed, at low ratios (e.g., 0.5:1) (Figure 5, lower panel). 

A proliferation assay is performed to determine if the condensed nucleic 
acid had an effect on the ability of FGF2-K to bind to cognate receptor and stimulate 
mitogenesis. The proliferation assay shows that only the highest dose of nucleic acid 
(200 ng) has a slightly inhibitory effect on proliferation as compared to FGF2-3 plus 
poly-L-lysine + DNA (Figure 6). 
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A FGF2-K84-DNA at a protein.DNA ratio of 2:1 is ir^troduced into COS 
cells and an endothelial cell line, ABAE, both of which express FGF receptors TT.e 
cells are subsequently assayed for P-galactosidase enzyme activity. COS and ABAE 
cells are grown on coverslips and incubated with the different ratios of FGF2-K DNA 
5 for 48 hours. Tl.e cells are then fixed and stained with X-gal. Maximal P-galactosidase 
enzyme activity is seen when 50 ,g of pSVp per 100 Mg of FGF2-3.polylysine 
conjugate is used. 

FGF2.K84-pSVp-gal a, a protei. ,o DNA ratio of 2:1 was added .o 
vanous eel, lines and incubated for 48 hr. Cell e«rae,s were prepared.- assayed for 
P-gal activio, and .ouU protein. As sho>™ in Figure 7A, COS. B16. NIH3T3. and BHK 
cell Imes were all able to take up complex and express P-gal. 

The expression of P-gal requires FGF2 for targeting into cells. pSVp or 
PNASSP plasn,id DNA was incubated wiU, (Figure 7B. lanes 1. 2) or without (lanes 3 
4) FGF2-K84 for 1 hr at room temperature. Complexes were added to COS cells for 48 
1 5 hr. Cell extracts were assayed for p.gal aaivity and normalised to total protein Only 
background p-gal activity was seen unless the plasmid was complexed with FGF2/K84 
Expression of P-gal is seen to be both time and dose-dependent (Figures 7C and 7D) 

Sensitivity of the receptor mediated gene delivety system is determined 

20 ri.? ™- » fonnation. Incteastng amo^ts of 

20 the FGF2-K/DNA complex is added to cells. 100 pg of F0F2-K was mixed witi, 50 ug 
of pSVp for 1 hour at room temperature. n,e COS and endothelial cells are incubated 
.ncreasing amounts of condensed material (0 ng. 1 ng, 10 ng. ,00 ng. ,000 ng and 
10.000 ng). The cells are incubated for 48 hours and then were assayed for 
P-galactosidase activity. ,n addition, cells grown on cover slips are treated with 1000 
23 ng of F0F2-K-DNA for 48 hours, then fixed and stained using X-gal. n,e P-gal 
enzyme assay reveals that witi, increasing amounts of material Urere is an increase in 
enzyme activity. (Figure 7D) Cells incubated with X-gal show blue staining 
throughout the cytoplasm in approximately 3% of the cells on the coversUp. 

Targeting of tiie complexes is specific for tiie FGF receptor First as 
30 seen in Figure 8A. FGF2-K84-pSVp-gal resulted in enzyme activity (lane 1), while only 
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background levels of activity were seen with FGF2+K84+DNA (lane 2), FGF2+DNA 
(lane 3), K84+DNA (lane 4), DNA (lane 5), FGF2-K84 (lane 6), FGF2 alone (lane 7) 
and K84 alone (lane 8). The expression of P-gal is specifically inhibited if free FGF2 is 
added during transfection (Figure 8B). Moreover, the addition of heparin attenuates the 
5 expression of p.gal (Figure 8C). Moreover, histone HI and cytochrome C were 
ineffective in delivering pSVp-gal (Figure 8C). 

Taken together, these findings support the hypothesis that the targeted 
DNA is introduced into receptor-bearing cells via the high affinity FGF receptor. 
Because histone can bind heparin sulfate yet fails to elicit a signal, the introduction of 
1 0 DNA appears independent of the low affinity FGF receptor or non-specific endocytosis. 

D. Effect of endosome-disruptive peptides 

Targeting is mediated by passage of the complex through endosomes. 
Chloroquine, which was added to complexes before transfection, resulted in an 8-fold 

1 5 increase in p-gal activity (Figure 9A). 

Based on this, the effect of endosome disruptive peptides was evaluated. 
The peptide INF7, GLF EAIEGFIEN GWEGMIDGWYGC, derived fi-om influenza 
virus, was synthesized. A complex between FGF2-K84 (5 ^ig) and pSVp.gal plasmid 
DNA (5 ^g) was formed. At this ratio, approximately half of the negative charge of the 

20 DNA was neutralized by the conjugate. K84, poly-L-lysine, was further added to 
saturate binding to the remaining DNA. The INF7 peptide was added 30 minutes later. 
The complex is added to COS cells and P-gal activity is assayed 48 or 72 hr later. 

The amount of fi-ee poly lysine necessary to neutralize the DNA and 
allow INF7 to complex was determined. Polylysine was added at 4, 10, or 25 |ag to the 

25 FGF2-K84/pSVp-gal complex. To each of these complexes four different 
concentrations of INF7 were added. Maximal P-gal expression was seen with 4 )ag of 
K84 and 12 ng of INF7 (Figure 13 A). When higher amounts of poly-lysine were used, 
more cell death resulted. The optimal amount of INF7 was determined using 4 jig of 
polylysine. As seen in Figure 13B, 24 of INF7 gave maximal p-gal activity. At 72 
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hr, 48 Mg of INF7 gave maximal P-gal activity (approximately 20-32 fold enhancement) 
(Figure 13C). 

When an endosome disruptive peptide was included in the complex 
expression of P-gal was increased 26-fold (Figure 9B). Concomitant with this increased 
5 level of expression was an increase in the number of cells expressing p-gal. As seen in 
Figure 9C. when endosome disruptive peptide (EDP) was present (right panel), W..5% 
of cells express p-gal in comparison to O.WMVo without EDP added (left panel). 



. EXAMPLE 9 



Cytotoxic Activity of FGF/Poly-L-Lysine 
Bound to SAP DNA Plasmid 

The cytotoxicity assay measures viable cells after transfection with a 
15 cytocide-encoding agent. When FGF-2 is the receptor-binding internalized ligand 
COS7 cells, which express FGFR. may be used as targets, and T47D, which does not 
express a receptor for FGF-2 at detectable levels, may be used as negative control cells. 

Cells are plated at 38,000 cells/well and 48,000 cells/well in a 12-well 
tissue culture plate in RPMI 1640 supplemented with 50/0 FBS. The complex FGF2- 
20 K/PZ200M (a plasmid which expresses saporin) is incubated with COS? or T47D cells 
for 48 hrs. Controls include FGF2-K alone, pZ200M alone, and FGF-^ plus 
poIy-L-lysine plus pZ200M. Following incubation, cells are rinsed in PBS lacking 
Mg^ and Ca^. Tfypsin at O.W. is added for 10 min and cells are harvested and 
washed. Cell number from each well is determined by a Coulter particle counter (or 
25 equivalent method). A statistically significant decrease in cell number for cells 
mcubated with FGF2-K/pZ200M compared to FGF2-K or pZ200M alone indicates 
sufficient cytotoxicity. 

FGF2-polylysine-DNASAP complexes show selective cytotoxicity. To 
optimize the expression of the plant RIP, saporin, in mammalian cells, a synthetic 
30 saporin gene using preferred mammalian codons and introduced a "Kozak" sequence for 
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translation initiation. The synthetic gene was then cloned into SV40 promoter and 
promoterless expression vectors. Because the expression of SAP from SAP-encoding 
DNA would only be feasible if the mammalian ribosome can synthesize the protein 
(SAP) prior to its inactivation by the SAP synthesized, the enzymatic activity of saporin 
5 encoded by the synthetic gene was tested. SAP was cloned into a T7/SP6 promoter 
plasmid and sense RNA was generated using T7 RNA polymerase. The RNA was then 
added to a mammalian in vitro translation assay. The results from this cell-free in vitro 
translation assay clearly show that the saporin expressed in a mammalian system can 
inhibit the expression of protein mutagenesis (Figure 10). When added above to the 
1 0 lysate, SAP mRNA is translated into a protein that has the anticipated molecular weight 
of the saporin protein (lane 2). Similarly, when luciferase mRNA is added to the lysate, 
a molecule consistent with the luciferase protein is detected (lane 3). In contrast, if SAP 
mRNA is added to the lysate along with or 30 minutes prior to luciferase mRNA, 
saporin activity is detected (lanes 4 and 5). 
15 Transfection of cells with SAP DNA demonstrates cytotoxicity. When a 

mammalian expression vector encoding saporin is transiently expressed in NIH 3T3 
cells using CaP04, there is a >65% decrease in cell survival (lane 3) compared to cells 
mock transfected (lane 1) or transfected with DNA encoding p-gal (lane 2) (Figure 1 1). 

To determine whether the FGF2-K can transfer plasmid DNA encoding 
20 SAP into FGF receptor bearing cells, FGF2-K was condensed with the pSV40-SAP 
plasmid DNA at a ratio of 2:1 (w:w). BHK 21 and NIH 3T3 ceils were used as the 
target cells. The cells (24,000 cells/well) were incubated with either FGF2-K-DNASAP 
or an FGF2-K-DNAp-gal complex. After 72 hours of incubation, cell number was 
determined. As shown in Figure 12, there is a significant decrease in cell number when 
25 cells are incubated with the FGF2-K-DNASAP complex compared to cells incubated 
with the FGF2-K-DNAp-gal complex. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration. 
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various .odificatic. „,ay be .ade ™U,ou, deviating fto„ ,he spin, and scope of U,e 
mvenuon. Accordingly, .he invention is no, ,i™,ed excep, as by U,e appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Prizm Pharmaceuticals. Inc. 
TREATMENT^^^^^ INVENTION: COMPOSITIONS CONTAINING NUCLEIC ACIDS AND LIGANDS FOR THERAPEUTIC 

(iii) NUMBER OF SEQUENCES: 106 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY 

(B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 
-(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(0) SOFTWARE: Patent In Release #1.0. Version #1.25 

(vi)' CURRENT APPLICATION DATA: ' 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 16-MAY-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Nottenburg Ph.D.. Carol 

(B) REGISTRATION NUMBER; 39.317 

(C) REFERENCE/DOCKET NUMBER: 760100. 415PC 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (206) 622-4900 
(B) TELEFAX: (206) 682-6031 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13. .456 

(0) OTHER INFORMATION: /product^ "VEGF^zi -encoding DNA" 
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Clx) FEATURE: 

CA) NAME/KEY: CDS 
(B) LOCATION: 13.. 90 

(D) OTHER INFORMATION: /product= leader- encoding sequence 
(x1) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 
Met Asn Phe Leu Leu Sen Trp Val His Trp Ser Leu 



10 



48 



144 



192 



20 25 

SI? CAT CAC GAA GTG GTG AA6 HC ATP 

Met A a Glu Gly Gly Gly Gin Asn His His Glu Val 5al Z Set 

35 40 

-JO tic; ^„ 

60 

fl^ Se^ S?S o^S II S - - ™ - « - no « c„ .0 

65 70 75 

S Sf S S JI? S - - - - - « « 3CC ™ 

85 90 

100 105 

" S 5?^ ^5 ^ i^l f 12 - - ™ S - ™ c« 3.. 
115 120 

H?S J^n' tTs lyl I'' f ' ^ '^■^ '^CA AGA CAA GAA 432 

HIS Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
130 135 

^JJ Cvl f^'^ o"^^ ''^^ '^^^ TGATGAATGA ATGAGGATCC 47, 

Lys Cys Asp Lys Pro Arg Arg ^^-^ 

145 

(2) INFORMATION FOR SEQ ID N0:2: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: double 
(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: cDNA . 



(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 13. .588 

(D) OTHER INFORMATION: /product^ "VEGF^^s- encoding DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 90 

(D) OTHER INFORMATION: /product= "leader sequence-encoding ONA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

6GATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Sen Leu 
1 5 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 96 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 

- - 30 . . _ _ 35 . . , . _ . 40 _ 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 240 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 288 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
no 115 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA 480 
Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin 
145 150 155 
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GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT JCC 
ASP Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn j'r Z sfr Z Ifs 
160 165 170 



528 



576 



a'S ire n n r'^' .™ AAG 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys 
175 180 185 ^ 

CCG AGG CGG TGATGAATGA ATGAGGATCC 

Pro Arg Arg o05 
190 



(2) INFORMATION FOR SEQ ID N0:3: 

(T) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 677 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

CA) NAME/KEY: CDS 
(B) LOCATION: 13. 657 

(D) OTHER INFORMATION: /product= "VEGF,,^- encoding DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 90 

(D) OTHER INFORMATION: /product= "leader sequence-encoding DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

GGATCCGAAA CC ATG AAC TTJ CTG CTG TCT TGG GTG CAT TGG AGC CTT 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
^ 5 10 

lu Uu I In .'I' l^' '"^^ TCC CAG GCT GCA CCC 

Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

Set fu ITu nt rf rf '''' ATG 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 

35 40 

Iso SIf Itl ?^ c^^ ™^ '^TC GAG ACC CTG GTG GAC 

Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 

55 60 

ne rf t''^ ""^^ '^''^ "^^^ ATC TTC AAG CCA TCC 

lie Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr lie Phe Lys Pro Ser 
65 70 75 

III SI? ?^ '^^^ '^^ ^^'^ T'^C TGC AAT GAC GAG GGC CTG 

Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 



48 



96 



144 



192 



240 



288 
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GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
G1u Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT GAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
He Lys Pro His G1n Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAG GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAA AAA TCA GTJ CGA GGA AAG GGA AAG G6G CAA AAA CGA AAG CGC AAG 480 
Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys 
145 150 155 . 

AAA TCC CGG TAT AAG TCC TGG AGC GTT CCC TGT GGG CCT TGC TCA GAG 528 
Lys Ser Arg Tyr Lys Ser Trp Ser Val Pro Cys Gly Pro Cys Ser Glu 
160 165 170 

CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC 576 
Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser 
175 ISO 185 

JGC_AAA AAC ACA GACJCG CGT TGC AAG .GCG.AGG CAG CTT GAG TTA AAC 624 
Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn 
190 195 200 

GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAATGA ATGA66ATCC 677 
Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
205 210 215 



(2) INFORMATION FOR SEQ ID N0:4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

CA) NAME/KEY: CDS 
(8) LOCATION: 13.. 711 

(D) OTHER INFORMATION: /product= "VEGF.Q.-encodinq DNA" 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 90 

(D) OTHER INFORMATION: /product= leader sequence encoding DNA 
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SEQUENCE DESCRIPTION: SEQ ID N0:4: 



144 



192 



240 



288 



336 



GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 4, 
Met Asn Phe Leu Leu Ser Trp Va) His Trp Sen Leu 

- s s s ;^ s « s s .| s s - « 
s 0^ s 3^ z sji « f : ™ ™ « ™ ™ 

35 40 
^° 55 

Se^ S^. ?| S « - - « - no « c« .c 

70 75 

SI? s s s j» - « « - - «c .c 

85 90 

cS SI? So ?s G^ts r ^^'^ 

vai Pro Thr Glu Glu Sen Asn He Thr Met Gin He Met Arg 

105 

?S S „« .'J; « « « ™ C« «c „0 «C ™ CTA « 38. 

110 ^ , 'l* "'^ ^'^ ^'>' Glu Met Ser Phe Leu G1n 

120 

H?s Z iTs Itl ITu Ifs Tra T ^ 
Lys cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 

130 135 

155 

180 

200 

s s s s s s: s - 



432 



480 
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AAC GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAATGA 718 
Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
225 230 235 

ATGAGGATCC 728 
(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix)' FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(D) OTHER INFORMATION: /note "human HBEGF precursor" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Lys Leu Leu Pro Ser Va1 Val Leu Lys Leu Phe Leu Ala Ala Val 

1 ._ , . 5_ 10 . _ 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 

Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

Lys Gly Lys G1y Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

Lys Asp Phe Cys lie His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

Ala Pro Ser Cys lie Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 
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Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 

200 205 

(2) INFORMATION FOR SEQ ID N0:6; 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 77 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note "human mature HBEGF" 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:6: 



Arg Val Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr Pro Asn Lys 
^5 10 j5 

Glu Glu His Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys 



30 



Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys He His Gly Glu 

40 45 

Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys He Cys His Pro 

55 60 

Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
CD) TOPOLOGY: unknown 

(1i) MOLECULE TYPE: protein 

(1x) FEATURE: 

(D) OTHER INFORMATION: /note "monkey HBEGF precursor" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Leu Leu Ala Ala Val 
^5 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Gin Leu Arg Arg Gly 
20 25 30 
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Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Ser Thr Gly Ser Thr Asp 
35 40 45 

Gin Leu Leu Arg Leu Gly Gly Gly Arg Asp Arg Lys Val Arq Asp Leu 
50 55 60 

Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Ser Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val. Cys Leu Leu 
165 ~ 170 ' - 175 ' 

Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note "rat HBEGF precursor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
15 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 
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Leu Ala Ala Ala Thr Ser Asn Pro Asp Pro Pro Thr Gly Thr Thr Asn 

40 45 

Gin Leu Leu Pro Thr Gly Ala Asp Arg Ala Gin Glu Val Gin Asp Leu 

55 60 

Glu Gly Thr Asp Leu Asp Leu Phe Lys Val Ala Phe Ser Ser Lys Pro 
70 75 80 

Gin Ala Leu Ala Thr Pro Gly Lys Glu Lys Asn Gly Lys Lys Lys Arg 
85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Lys Lys Tyr 
100 105 110 

Lys Asp Tyr Cys He His Gly Glu Cys Arg Tyr Leu Lys Glu Leu Arg 
115 120 125 

lie Pro Ser Cys His Cys Leu Pro Gly Tyr His Gly Gin Arg Cys His 

135 14Q 

Gly Leu Thr Leu Pro Val Glu Asn Pro Leu Tyr Thr Tyr Asp His Thr 

155 160 

Thr Val Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 

185 jgQ 

Asp Leu Glu Ser Glu Glu Lys Val Lys Leu Gly Met Ala Ser Ser His 
195 200 205 

(2) INFORMATION FOR SEQ 10 N0:9: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH; 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY; both 

(11 ) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(D) OTHER INFORMATION: /note "human HBEGF precursor" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9; 

ATG AA6 CTG CTG CCG TCG GT6 GTG CT6 AAG CTC TTT CTG GCT GCA GH 48 
Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
5 10 15 



CTC TCG GCA CTG GTG ACT GGC GAG AGC CTG GAG CGG CTT CGG AGA 
Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg 



25 30 



GGG 96 
Gly 
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CTA GCT GCT GGA ACC AGC AAC CCG GAC CCT CCC ACT GTA TCC ACG GAC 144 
Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

CAG CTG CTA CCC CTA GGA GGC GGC CGG GAC CGG AAA GTC CGT GAC TTG 192 
Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 

CAA GAG GCA GAT CTG GAC CTT TTG AGA GTC ACT TTA TCC TCC AAG CCA 240 
Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
65 70 75 80 

CAA GCA CTG GCC ACA CCA AAC AAG GAG GAG CAC GGG AAA AGA AAG AAG 288 
Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

AAA GGC AAG GGG CTA GGG AAG AAG AGG GAC CCA TGT CTT CGG AAA TAC 336 
Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

AAG GAC TTC TGC ATC CAT GGA GAA TGC AAA TAT GTG AAG GAG CTC CGG 384 
Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

GCT CCC TCC TGC ATC TGC CAC CCG GGT TAC CAT GGA GAG AGG TGT CAT 432 
Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys' His 
130 135 140 

GGG CTG AGC CTC CCA GTG GAA AAT CGC TTA TAT ACC TAT GAC CAC ACA 480 
Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 150 155 160 

ACC ATC CTG GCC GTG GTG GCT GTG GTG CTG TCA TCT GTC TGT CTG CTG 528 
Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

GTC ATC GTG GGG CTT CTC ATG TTT AGG TAC CAT AGG AGA GGA GGT TAT 576 
Val lie Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

GAT GTG GAA AAT GAA GAG AAA GTG AAG TTG GGC ATG ACT AAT TCC CAC 624 
Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



TGA 

(2) INFORMATION FOR SEQ 10 NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(li) MOLECULE TYPE: peptide 



627 
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(ix) FEATURE: 

(D) OTHER INFORMATION: /note- "FGF-I" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 
15 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 
20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 
35 40 45 

Thr Arg Asp Arg Ser Asp Gin His lie Gin Leu Gin Leu Ser Ala Glu 
. 55 60 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 
85 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 
100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lvs 
lis 120 125 

Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 
130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
l^'S 150 155 

(2) INFORMATION FOR SEO ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "FGF-2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
^5 10 15 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 
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Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 

65 70 . . 75 80 

Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
* 115 120 ' 125 

Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= TGF-3" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Leu He Trp Leu Leu Leu Leu Ser Leu Leu Glu Pro Gly Trp 
15 10 15 

Pro Ala Ala Gly Pro Gly Ala Arg Leu Arg Arg Asp Ala Gly Gly Arg 
20 25 30 

Gly Gly Val Tyr Glu His Leu Gly Gly Ala Pro Arg Arg Arg Lys Leu 
35 40 45 

Tyr Cys Ala Thr Lys Tyr His Leu Gin Leu His Pro Ser Gly Arg Val 
50 55 60 

Asn Gly Ser Leu Glu Asn Ser Ala Tyr Ser He Leu Glu He Thr Ala 
65 70 75 80 

Val Glu Val Gly He Val Ala He Arg Gly Leu Phe Ser Gly Arg Tyr 
85 90 95 
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Leu Ala Met Asn Lys Arg Gly Arg Leu Tyr Ala Ser Glu His Tyr Sen 
100 105 no 

Ala Glu Cys Glu Phe Val Glu Arg He His Glu Leu Gly Tyr Asn Thr 
115 120 125 

Tyr Ala Ser Arg Leu Tyr Arg Thr Val Ser Ser Thr Pro Gly Ala Arg 
130 135 140 ^ 

Arg Gin Pro Ser Ala Glu Arg Leu Trp Tyr Val Ser Val Asn Gly Lys 
150 155 160 

Gly Arg Pro Arg Arg Gly Phe Lys Thr Arg Arg Thr Gin Lys Ser Ser 
165 170 

Leu Phe Leu Pro Arg Val Leu Asp His Arg Asp His Glu Met Val Arg 
180 185 190 

Gin Leu Gin Ser Gly Leu Pro Arg Pro Pro Gly Lys Gly Val Gin Pro 
195 200 205 

Arg Arg Arg Arg Gin Lys Gin Ser Pro Asp Asn Leu Glu Pro Ser His 
210 215 220 

Val Gin Ala Ser Arg Leu Gly Ser Gin Leu Glu Ala Ser Ala His 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(1x) FEATURE: 

(D) OTHER INFORMATION: /note= "FGF-4" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

Met Ser Gly Pro Gly Thr Ala Ala Val Ala Leu Leu Pro Ala Val Leu 
1 5 10 15 . 

Leu Ala Leu Leu Ala Pro Trp Ala Gly Arg Gly Gly Ala Ala Ala Pro 
20 25 30 

Thr Ala Pro Asn Gly Thr Leu Glu Ala Glu Leu Glu Arg Arg Trp Glu 
35 40 45 

Ser Leu Val Ala Leu Ser Leu Ala Arg Leu Pro Val Ala Ala Gin Pro 
50 55 60 

Lys Glu Ala Ala Val Gin Ser Gly Ala Gly Asp Tyr Leu Leu Glv lie 
^5 70 75 80 
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Lys Arg Leu Arg Arg Leu Tyr Cys Asn Val Gly He Gly Phe His Leu 
85 90 95 

Gin Ala Leu Pro Asp Gly Arg lie Gly Gly Ala His Ala Asp Thr Arg 
100 105 110 

Asp Ser Leu Leu Glu Leu Ser Pro Val Glu Arg Gly Val Val Ser He 
115 120 125 

Phe Gly Val Ala Ser Arg Phe Phe Val Ala Met Ser Ser Lys Gly Lys 
130 135 140 

Leu Tyr Gly Ser Pro Phe Phe Thr Asp Glu Cys Thr Phe Lys Glu He 
145 150 155 160 

Leu Leu Pro Asn Asn Tyr Asn Ala Tyr Glu Ser Tyr Lys Tyr Pro Gly 
165 170 175 

Met Phe ne Ala Leu Ser Lys Asn Gly Lys Thr Lys Lys Gly Asn Arg 
180 185 190 

Val Ser Pro Thr Met Lys Val Thr His Phe Leu Pro Arg Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "FGF-5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ser Leu Ser Phe Leu Leu Leu Leu Phe Phe Ser His Leu He Leu 
15 10 15 

Ser Ala Trp Ala His G1y Glu Lys Arg Leu Ala Pro Lys Gly Gin Pro 
20 25 30 

Gly Pro Ala Ala Thr Asp Arg Asn Pro He Gly Ser Ser Ser Arg Gin 
35 40 45 

Ser Ser Ser Ser Ala Met Ser Ser Ser Ser Ala Ser Ser Ser Pro Ala 
50 55 60 

Ala Ser Leu Gly Ser Gin Gly Ser Gly Leu Glu Gin Ser Ser Phe Gin 
65 70 75 80 

Trp Ser Pro Ser Gly Arg Arg Thr Gly Ser Leu Tyr Cys Arg Val Gly 
85 90 95 
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He Gly Phe His Leu Gin He Tyr Pro Asp Gly Lys Val Asn Gly Ser 
100 105 

His Glu Ala Asn Met Leu Ser Val Leu Glu He Phe Ala Val Ser Gin 
115 120 125 

Gly lie Val Gly He Arg Gly Val Phe Ser Asn Lys Phe Leu Ala Met 
i-^U 135 140 

Ser Lys Lys Gly Lys Leu His Ala Ser Ala Lys Phe Thr Asp Asp Cys 
150 155 160 

Lys Phe Arg Glu Arg Phe Gin Glu Asn Ser Tyr Asn Thr Tyr Ala Ser 
165 170 175 

Ala He His Arg Thr Glu Lys Thr Gly Arg Glu Trp Tyr Val Ala Leu 
180 185 190 

Asn Lys Arg Gly Lys Ala Lys Arg Gly Cys Ser Pro Arg Val Lys Pro 
155 200 205 

Gin His lie Ser Thr His Phe Leu Pro Arg Phe Lys Gin Ser Glu Gin 
210 215 220 

Pro Glu Leu Ser Phe Thr Val Thr Val Pro Glu Lys Lys Asn Pro Pro 
230 235 240 

Ser Pro He Lys Ser Lys He Pro Leu Ser Ala Pro Arg Lys Asn Thr 
245 250 255 

Asn Ser Val Lys Tyr Arg Leu Lys Phe Arg Phe Glv 
260 265 

(2) INFORMATION FOR SEO ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "FGF-e" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ser Arg Gly Ala Gly Arg Leu Gin Gly Thr Leu Trp Ala Leu Val 
15 10 15 

Phe Leu Gly He Leu Val Gly Met Val Val Pro Ser Pro Ala Gly Thr 
20 25 30 

Arg Ala Asn Asn Thr Leu Leu Asp Ser Arg Gly Trp Gly Thr Leu Leu 
35 40 45 
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Ser Arg Ser Arg Ala Gly Leu A1a 61y Glu He Ala Gly Val Asn Trp 
50 55 60 

Glu Ser Gly Tyr Leu Val Gly He Lys Arg Gin Arg Arg Leu Tyr Cys 
65 70 75 80 

Asn Val Gly He Gly Phe His Leu Gin Val Leu Pro Asp Gly Arg He 
85 90 95 

Ser Gly Thr His Glu Glu Asn Pro Tyr Ser Leu Leu Glu He Ser Thr 
100 105 110 

Val Glu Arg Gly Val Val Ser Leu Phe Gly Val Arg Ser Ala Leu Phe 
115 120 125 

Val Ala Met Asn Ser Lys Gly Arg Leu Tyr Ala Thr Pro Ser Phe Gin 
'130 135 140 

Glu Glu Cys Lys Phe Arg Glu Thr Leu Leu Pro Asn Asn Tyr Asn Ala 
145 150 155 160 

Tyr Glu Ser Asp Leu Tyr Gin Gly Thr Tyr He Ala Leu Ser Lys Tyr 
165 170 175 

Gly Arg Val Lys Arg Gly Ser Lys Val Ser Pro He Met Thr Val Thr 
180 . 185_ _ 190 

His Phe Leu Pro Arg He 
195 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(0) OTHER INFORMATION: /note= "FGF-7" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Met His Lys Trp He Leu Thr Trp He Leu Pro Thr Leu Leu Tyr Arg 
15 10 15 

Ser Cys Phe His He He Cys Leu Val Gly Thr He Ser Leu Ala Cys 
20 25 30 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 
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Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 

75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 90 gg 

He Met Glu lie Arg Thr Val Ala Val Gly He Val Ala He Lys Gly 
100 105 no 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
115 120 125 

Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu 
130 135 140 

Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly 
150 155 

Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val Arg Gly 
165 170 275 

Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala 

185 190 

He Thr 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 215 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(0) OTHER INFORMATION: /note= "FGF-S" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Gly Ser Pro Arg Ser Ala Leu Ser Cys Leu Leu Leu His Leu Leu 
1 5 . 10 15 

Val Leu Cys Leu Gin Ala Gin Val Thr Val Gin Ser Ser Pro Asn Phe 
20 25 30 

Thr Gin His Val Arg Glu Gin Ser Leu Val Thr Asp Gin Leu Ser Aro 
35 40 45 

Arg Leu He Arg Thr Tyr Gin Leu Tyr Ser Arg Thr Ser Gly Lys His 
50 55 50 

Val Gin Val Leu Ala Asn Lys Arg lie Asn Ala Met Ala Glu Asp Gly 

75 80 
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Asp Pro Phe Ala Lys Leu He Val Glu Thr Asp Thr Phe Gly Ser Arg 
85 90 95 

Val Arg Val Arg Gly Ala Glu Thr Gly Leu Tyr lie Cys Met Asn Lys 
100 105 no 

Lys Gly Lys Leu He Ala Lys Ser Asn Gly Lys Gly Lys Asp Cys Val 
115 120 125 

Phe Thr Glu He Val Leu Glu Asn Asn Tyr Thr Ala Leu Gin Asn Ala 
130 135 140 

Lys Tyr Glu Gly Trp Tyr Met Ala Phe Thr Arg Lys Gly Arg Pro Arg 
150 155 160 

Lys Gly Ser Lys Thr Arg Gin His Gin Arg Glu Val His Phe Met Lys 
165 170 175 

Arg Leu Pro Arg Gly, His His Thr Thr Glu Gin Ser Leu Arg Phe Glu 
180 185 190 

Phe Leu Asn Tyr Pro Pro Phe Thr Arg Ser Leu Arg Gly Ser Gin Arg 
195 200 205 

Thr Trp Ala Pro Glu Pro Arg 

^.210 _ 215 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "FGF-9" 

(x1) SEQUENCE DESCRIPTION: 5E0 ID NO: 18: 

Met Ala Pro Leu Gly Glu Val Gly Asn Tyr Phe Gly Val Gin Asp Ala 
15 10 15 

Val Pro Phe Gly Asn Val Pro Val Leu Pro Val Asp Ser Pro Val Leu 
20 25 30 

Leu Ser Asp His Leu Gly Gin Ser Glu Ala Gly Gly Leu Pro Arg Gly 
35 40 45 

Pro Ala Val Thr Asp Leu Asp His Leu Lys Gly He Leu Arg Arg Arg 
50 55 60 

Gin Leu Tyr Cys Arg Thr Gly Phe His Leu Glu lie Phe Pro Asn Gly 
65 70 75 80 
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Thr He Gin Gly Thr Arg Lys Asp His Ser Arg Phe Gly He Leu Glu 
85 90 95 

Phe He Ser He Ala Val Gly Leu Val Ser He Arg Gly Val Asp Ser 
100 105 no 

Gly Leu Tyr Leu Gly Met Asn Glu Lys Gly Glu Leu Tyr Gly Ser Glu 
115 120 125 

Lys Leu Thr Gin Glu Cys Val Phe Arg Glu Gin Phe Glu Glu Asn Trp 
130 135 140 

Tyr Asn Thr Tyr Ser Ser Asn Leu Tyr Lys His Val Asp Thr Gly Arg 
^ 150 155 160 

Arg Tyr Tyr Val Ala Leu Asn Lys Asp Gly Thr Pro Arg Glu Gly Thr 
165 170 175 

Arg Thr Lys Arg His Gin Lys Phe Thr His Phe Leu Pro Arg Pro Val 
180 185 190 

Asp Pro Asp Lys Val Pro Glu Leu Tyr Lys Asp He Leu Ser Gin Ser 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 19; 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: mi sc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note=^ "Nucleotide sequence 

corresponding to the clone M13 fnpl8-G4 in Example I B 2 - 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(0) OTHER INFORMATION: /product= ""Saporin"" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCA TGG ATC CTG CTT CAA TTT TCA 6CT TGG ACA ACA ACT GAT GC6 GTC 
A a irp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
^ -10 -5 J 



48 
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ACA TCA ATC ACA TTA GAT CTA 6TA AAT CCG ACC GCG G6T CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TIT GTG GAT AAA ATC CGA AAC AAT GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp lie Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TT6 TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GH AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala 
85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CA6 AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro^lu Ala Thr Thr Ala Asn Gin Lys Ala 
Too " 105 ' 110 " 

HA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC HA 480 
Thr Gin Gly Asp Lys Ser Arg Lys. Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 145 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT>.CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 572 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 
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CAA ATG GGA CTC CTl ATG TAT TIG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Jyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note= "Nucleotide 



corresponding to the clone M13 mpI8-Gl in Example I.B.2." 



(ix) FEATURE: 

(A) NAME/KEY: mat_pept1de 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product= "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
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CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAG TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser Ala 
85 90 95 

GAG HA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 145 

CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GJl AAA 528 
Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTX CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 576 
_ Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val 

165 170 ' ' ' ~ 175 

GCA CGA JTl CGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGA GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: unknown 



(ii ) MOLECULE TYPE: cDNA 
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(ix) FEATURE : 

CA) NAME/KEY: CDS 
(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: m1sc feature 

(B) LOCATION: 1 804 

(0) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone HU mpl8-G2 in Example 1. 8.2. 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product^ "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

t'pd ?Ip f ^ II" ™ ACA ACT GAT GCG GTC 

Ala Trp He Leu Leu G n Phe Ser Ala Trp Thr Thr Thr Asp Ala Sal 

.5 ^ 

l^^ ™ ^"^^ AAT CCG ACT GCG GGT CAA TAC TCA 

Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala cfy gTu Tyr Ser 

III SIf Z ul f ' ^TG AAA 144 

Hhe val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 

25 30 

^fr I z z s 51? ??: ^1 s s s; ^ 2; 
s ,-5 f ;s s s i s; s s - 

60 65 

fT. ?^ '^TC GCG TAT CTT GCA ATG GAT AAC 28ft 

Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Mn 
7° 75 so 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie j" £ Sa 

90 95 

=^ s s s; z i?. r s z s » 
s o^r ;^ r s ^ « - « ™ «^ 

120 125 

T^r no r?^ f ^ '^^^ CTC GGG TTG GGG ATC GAC TTA 480 

Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Mp iet '° 
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CTJ TIG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTl ATC GCT ATT CAA ATG ACA 6CT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu lie A1a He Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT AGG TAC ATI CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TG6 CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys lie Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TIC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA -ATG-GGA CTG CTT ATG TAT TTG GGC AAA CCA AAG . _ . _ 804. 

Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME7KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 mpl8-G7 in Example LB. 2." 

(1x) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product= "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
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GCA TGG ATC CTG CTJ CAA TU TCA GCT TGG ACA ACA ACT GAT GCG GTC 
A a Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
■^^ -10 -5 J 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 
ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp p" Zn leu ^ 

25 30 

jfr G? V SI T^r n'' f.' '''' TCT AAA GAA AAA 

Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 

■^■^ 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CH GGC 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
" 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CH GCA ATG GAT AAC 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Z 
^° 75 80 

fJ] f ^ ^'^^ TTC AGA TCA GAA ATT ACT TCC GCC 

Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser IVa 
°5 90 95 

GAG TTA ACC GCC CTT HC CCA GAG GCC ACA ACT GCA AAT CA6 AAA GCT 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn GlS 'fa 
100 105 110 

L™ ITu 'jfr THr S'^' r'^' '^^ ^^C CAG ATA 

Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 

115 120 125 

S nn f ^ J^* ^ ™ 2GG ATC GAC TTA 

Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
135 140 145 

fH. l'^^ ^^'^ ^'^^ AAG AAG GCA CGT GTG GH AAA 

Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

155 160 

Z ^, SI r 11^ .^^ '^"^'^ ^CT ATT CAG ATG ACG GCT GAG GCA 
Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Ala 
165 170 

f^t JT '^^^ ^AA AAC HG GTA ATC AAG AAC TTT CCC AAC 

Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn pII Pro Z 
180 185 190 

fis JJp l'^'^ '^^'^ 2AG GTJ AAC TGG AAA 

Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp Lys 
195 200 205 
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AAA ATT TCT ACG GCA ATA TAG GGG GAT GCC AAA AAC GGC 6TG TTT AAT 720 
Lys He Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT HC GGG TTT GGA AAA GTG AGG CAG GT6 AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID N0:23: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 804 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..804 

" (ix) FEATURE ■■ ' ' - - - ' 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note- "Nucleotide sequence 

corresponding to the clone M13 mpl8-G9 in Example I.B.2." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46. .804 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GC6 GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 

TCT JTJ GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 
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TTC Cn AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TT6 TAT GTG GTC 6CG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

f ^ '^'^^ AGA TCA GAA ATT ACT TCC GCC 336 

Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser Ala 
85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 

HA GAA TAC ACA GAA GAT TAT CAG TCG ATT GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAA GGA GAT CAA AGT AGA AAA GAA CTC G6G HG GGG ATT GAC TJA 
Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
135 140 145 

CTT TCA ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 
Leu Ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 150 

GAC GAA GCT AGA TTC CTT CTT ATC GCT ATT CAG ATG ACG GCT GAG GCA 
Asp Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Ala 
165 170 175 

GCG CGA TTT AGG TAC ATA CAA AAC TTG GTA ATC AAG AAC TTT CCC AAC 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn Phe Pro Asn 
180 185 190 

VJ^ "^^^ ^'^ ^ "^^^ ^'^'^ CAG TTT GAG GTJ AAC TGG AAA 672 
Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp Lys 
195 200 205 

AAA ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTl AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC HG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 7 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: 5EQ ID N0:24: 

Pro Lys Lys Arg Lys Val Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..8 

(0) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 

Pro Pro Lys Lys Ala Arg Glu Val 
1 5 

(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(0) OTHER INFORMATION: /product= nuclear translocation sequence 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

Pro Ala Ala Lys Arg Val Lys Leu Asp 
1 5 

(2) INFORMATION FOR SEQ ID N0:27: 
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(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.5 

(0) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27: 

Lys Arg Pro Arg Pro 
1 ' 5 

(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 

Lys He Pro lie Lys 
i 5 

(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

Gly Lys Arg Lys Arg Lys Ser 
1 5 
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(2) INFORMATION FOR SEO ID N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ser Lys Arg Val Ala Lys Arg Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1. .9 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser His Trp Lys Gin Lys Arg Lys Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: peptide 

(Ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: L .8 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
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(x1) SEQUENCE DESCRIPTION: SEQ ID N0:32: 

Pro Leu Leu Lys Lys He Lys Gin 
1 5 

(2) INFORMATION FOR SEQ ID N0;33: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 7 amino adds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

'(A) NAME/KEY: CDS 
(B) LOCATION: 1 7 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

Pro Gin Pro Lys Lys Lys Pro 
1 5 

(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

Cix) FEATURE: 

(A) NAME/KEY: CDS 
(8) LOCATION: 1..15 

(0) OTHER INFORMATION: /product= nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

Pro Gly Lys Arg Lys Lys Glu Met Thr Lys Gin Lys Glu Val Pro 
^ ^ 10 15 

(2) INFORMATION FOR SEQ ID N0:35: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..12 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID N0:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 

-Asn Tyr Lys -Lys Pro Lys Leu - . ^ 

1 5 

(2) INFORMATION FOR SEQ ID N0:37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7... 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:37: 

His Phe Lys Asp Pro Lys Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(IX) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Ala Pro Arg Arg Arg Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO:39: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: unknown 

(i1) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: COS 
CB) LOCATION: 1..6 

(0) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 

He Lys Arg Leu Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: unknown 

(i1) MOLECULE TYPE: peptide 

(Ix) FEATURE: 

(A) NAME/KEY: CDS 
(8) LOCATION: 1..6 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

He Lys Arg Gin Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 41: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

He Arg Val Arg Arg 
1 ' 5 

(2) INFORMATION FOR SEQ ID N0;42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Cytoplasmic Translocation 

Signal" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 

Lys Asp Glu Leu 

1 

(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note=. "Cytoplasmic Translocation 

Signal " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 

Arg Asp Glu Leu 
1 
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C2) IMFORMATION FOR SEQ ID NO: 44: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 4 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY; linear 

(ix) FEATURE: 

Signal" I™TION: /note= Cytoplasmic Translocation 

(XT) SEQUENCE DESCRIPTION: SEQ ID N0;44: 

Lys Glu Glu Leu 
1 

(2) INFORMATION FOR SEQ ID N0:45: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 23 amino acids 
CB) TYPE; amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY; linear 

(ix) FEATURE- 

(D) OTHER INFORMATION: /note= "Endosome-disruptive peptide INF' 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 

Gly Leu Phe Glu Ala lie Glu Gly Phe He Glu Asn Gly Trp Glu Gly 

Met He Asp Gly Gly Gly Cys 
20 

(2) INFORMATION FOR SEQ ID N0:46: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 24 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Endosome-disrupti ve peptide INF" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 

Gly Leu Phe Glu Ala He Glu Gly Phe lie Glu Asn Gly Trp Glu Gly 

Met He Asp Gly Trp Tyr Gly Cys 
20 
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(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; 3.. 26 

(A) NAME/KEY: Gly.Ser with Ncol ends 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
CCATGGGCGG CGGCGGCTCT GCCATGG 27 
(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 41 

(A) NAME/KEY: (Gly4Ser)2 with Ncol ends 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 
CCATGGGCGG CGGCGGCTCT GGCGGCGGCG - GCTCTGCCAT GG 42 
(2) INFORMATION FOR SEQ ID N0:49: 

(i) SEQUENCE CHARACTERISTICS: o 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 74 

(A) NAME/KEY: (Ser4Gly)4 with Ncol ends 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
CCATGGCCTC GTC6TCGTCG GGCTCGTCGT CGTCG6GCTC GTCGTCGTCG GGCTCGTCGT 60 
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CGTCGGGCGC CATGG 

(2) INFORMATION FOR SEQ ID NO: 50: 

CD SEQUENCE CHARACTERISTICS - 

(A) LENGTH:45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(1i) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 45 

(A) NAME/KEY: (Ser.Gly), 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unk nown 

(1i) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product= Flexible linker 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5I: 



Ala Ala Pro Ala Ala Ala Pro Ala 
i 5 

(2) INFORMATION FOR SEQ ID N0:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cONA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..465 

(ix) FEATURE: 
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(A) NAME/KEY: mat^peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product^ "bFGF" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:52: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC 6CC TIG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TIC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
*35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT ' ' 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GH ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC 465 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1230 

(ix) FEATURE: 

(A) NAME/KEY: mat_pepticle 
CB) LOCATION: 1..465 
(D) OTHER INFORMATION; /product- "bF6F" 

(ix) FEATURE: 

(A) NAME/KEY: niat_peptide 
(8) LOCATION: 472.. 1230 
(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;53: 

S '^^^ ^^"^ ^'^^ CCG GCT CTG CCG GAA GAC GGT 

Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

n V 21 'l? GAC CCC AAG CGG CTG 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

vY^ 5Cn rf f^'^ '^^ CAC ATC AAG CTT CAA CTT 192 

Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 



60 



48 



96 



144 



cTn S ^ f ^ ^'^ '^^^ '^^C ^ GGA GT6 TGT GCT AAC 240 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 1?fi 
val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 

105 110 

Th^ ^'^ ^ TGG TAT GTG GCA TTG AAA 384 

Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

50 7^1 rt^ l""^ ^ ^ CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 

^■^0 135 140 

51 nt IT ?^ n'^" ^'^^ see ATG GTC ACA TCA 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 

150 155 160 
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ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 528 
He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly G1n Tyr Ser Ser Phe 
165 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr G1y 
180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTJ 624 
Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TT6 TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG HA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu 
245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 

260 ~ - 265 ' " 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin 
275 280 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu 
290 295 300 

ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
305 310 315 320 

GCT AGG TTT CTG CJl ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA <^ 1008 
Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg 
325 330 335 

TTT AGG TAC ATT CAA AAC JIG GTA ACT AAG AAC TIC CCC AAC AAG TTC 1056 
Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 
340 345 350 

GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 1104 
Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He 
355 360 365 

TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT 1152 
Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
370 375 380 
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TAT GAT TTC GGG TTT GGA AAA GTG A6G CA6 GTG AAG 6AC TIG CAA ATG T?nn 
Tyr Asp Phe G]y Phe Gly Lys Val Arg Gin Val Lys Sp Leu ITn 

390 4(jQ 

GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG iPo. 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
405 410 

C2) INFORMATION FOR SEQ ID N0:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /standard_name= "EcoRI Restriction Site" 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 12. .30 

peptide" ™ INFORMATION: /function= "N-ter..nal extension" /product= "Native sapor.n s.gnal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
CTGCAGAATT CGCATGGATC CTGCTTCAAT 3q 



(2) INFORMATION FOR SEQ 10 N0:55: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH:. 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6. .11 

(D) OTHER INFORMATION: /standard_name= "EcoRI Restriction Si 

(ix) FEATURE: 

(A) NAME/KEY: terminator 

(B) LOCATION: 23.. 25 

(D) OTHER INFORMATION: /note= "Anti -sense stop codon" 
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(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 26.. 30 

(D) OTHER INFORMATION: /note- "Ant1 -sense to carboxyl 
terminus of mature peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:55 

CTGCAGAATT CGCCTCGTTT GACTACTTTG 30 

(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56: 
AGGAGTGTCT GCTAACC 17 
(2) INFORMATION FOR SEQ ID NO: 57: 

(iT SEQUENCE CHARACTERrSTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TTCTAAATCG GTTACCGAT6 ACTG 24 
(2) INFORMATION FOR SEQ ID N0:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58: 
CATATGTGTG AGCTACTGTC GCCACCGCTC 30 
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(2) INFORMATION FOR SEQ ID N0:59: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;59: 
GGATCCGAGC ACCTGGTATA TCGGTGGGGG 
(2) INFORMATION FOR SEQ 10 NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
GGATCCGCCT CGTTTGACTA CTT 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(D) OTHER INFORMATION/product= bacteriophage lambda CII ribosome binding site 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61: 
GTCGACCAAG CTTGGGCATA CATTCAATCA ATTGTTATCT AAGGAAATAC TTACATATG 59 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(0) OTHER INFORMATION: /product- trp promoter 
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(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
AATTCCCCTG TTGACAAHA ATCATCGAAC TAGTTAACTA GTACGCAGCT T6GCTGCAG 59 
(2) INFORMATION FOR SEQ ID N0:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i1) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE; 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 11.. 16 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme recognition site." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /product= "Carboxy terminus of 
mature FGF protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

GCTAAGAGCG CCATGGAGA 19 

(2) INFORMATION FOR SEQ ID N0:64: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(1i) MOLECULE TYPE: cDNA 

(ix) "FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product^ "Carboxy terminus of 
wild type FGF*' 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 13.. 18 

(D) OTHER INFORMATION; /standard__name= "Nco I restriction enzyme recognition site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64: 

6CT AAG AGC TGACCATGGA GA 21 
Ala Lys Ser 
1 
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(2) INFORMATION FOR SEQ ID N0:65: 

Ci) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..96 

(D) OTHER INFORMATION: /product= "pFCFNcoP 

/note= "Equals the plasmid pFCSO wih native FGF 
stop codon removed." 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 29. 34 

(D) OTHER INFORMATION: /standard_na.e= "Nco I restriction enzyme recognition site" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:65: 

?^ ir AGC 6CC ATG GAG ATC CGG CTG AAT 4ft 

Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Glu He A^g Leu Mn 
1 5 10 15 

rf "^^^ f^CC GTC m CAG GAC TCC T6AAATCTT 10? 

Gly Ala Val Leu Tyr Arg Phe Ser Cys Ala Val Phe Gin Asp Ser '^^^^^^ 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 66: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 35 

(A) NAME/KEY: Cathepsin B linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: 
CCATGGCCCT GGCCCTGGCC CTGGCCCTGG CCATGG 36 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 50 

(A) NAME/KEY: Cathepsin D linker 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:67: 

CCATGGGCCG ATCGGGCTTC CTGGGCTTCG GCHCCTGGG CTTCGCCATGG 51 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 96 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

- CD) -TOPOLOGY: 1-1 near - . _ 

(11) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 95 

(A) NAME/KEY: "Trypsin linker" 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:68: 
CCATGGGCCG ATCGGGCGGT GGGTGCGCTG GTAATAGAGT CA6AAGATCA GTCGGAAGCA 60 
GCCTGTCTT6 CGGTGGTCTC GACCTGCAGG CCATGG 96 
(2) INFORMATION FOR SEQ ID N0:69: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: cDNA 

(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(D) OTHER INFORMATION: /product= Thrombin substrate linker 
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(xi) SEQUENCE DESCRIPTION: SEO ID N0:69: 

CTG GTG CCG CGC G6C AGC 
Leu Val Pro Arg Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID N0:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /product= Enterokinase substrate linker 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:70: 

GAC GAC GAC GAC CCA 
Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE; 

(A) NAME/KEY: CDS 
(8) LOCATION: 1..12 

(D) OTHER INFORMATION: /product= Factor Xa substrate 

(xi) SEQUENCE DESCRIPTION; SEO ID N0:71: 

ATC GAA G6T CGT 
He Glu Gly Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 72; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cONA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1260 

(ix) FEATURE: 

(A) NAME/KEY: mat_pept1cle 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product= "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: niat_peptide 

(B) LOCATION: 466... 501 

(D) OTHER INFORMATION: /product- "Cathepsin B linker" 

(ix) FEATURE: 

' (A) NAME/KEY: mat peptide 
(B) LOCATION: 502~.1260 
(0) OTHER INFORMATION: /product^ "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

ATG GCA GCA GGA TCA ATA ACA ACA HA CCC GCC TTG CCC GAG GAT GGC 
Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 



48 



GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG " ' 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 

100 105 no 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 
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SI n . IT ^'^^ A6C GCC ATG GCC CTG GCC 

Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Leu Ala 

jgg 

CTG GCC CTG GCC CTG GCC ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA 
Leu Ala Leu Ala Leu Ala Met Val Thr Ser He Thr Leu Asp Leu Val 

170 17q 

Z So ?S f? ^ I'' I'' ^ ^ L AAC 

Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn 

180 185 190 

Sit fCs ^I p"^ f ' "^"^ ATA GCC GTG 

Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val 
195 200 205 

ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA An AAT TIC CAA AGT 
He G y Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser 
■^^^ 215 220 

llr Am gS ?hr SIf r f ^''^ ^ AAC TTG TAT GTG 

ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val 

'^^ 230 235 240 

SI^ aS I'^^ ^^"^ ^'''^ AAC ACG AAT GJJ AAT CGG GCA TAT TAC 
Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr 
245 250 255 

JJe Itr tr T^' l^' ™ ACC GCC CTT TTC CCA GAG 

Phe Lys Ser Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu 
260 265 270 

r SI f," f ^ ^TA GAA TAC ACA GAA GAT TAT CAG 
Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin 
275 280 285 

Sr ?r f ' ^^^^ ^^A ACA CAG GGA GAT AAA AGT AGA AAA 
Ser lie Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys 
"^^^ 295 300 

^ Leu G?v .'lu ff n ' f ' GCA GTG 

Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val 
310 

AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC 
Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu ?ie 
325 

SI ul aTn S CAA AAC 

Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn 
340 345 35Q 

Uu SI? jil f ^ 11^ ^C f^AT AAC AAG GTG 

Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val 
355 360 355 
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ATT CAA TTT GAA GTC AGC TGG C6T AAG ATT TCT ACG GCA ATA TAC GGG 1152 
He Gin Phe Glu Val Ser Trp Arg Lys Me Sen Thr Ala He Tyr Gly 
370 375 380 

GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA 1200 
Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly 
385 390 395 400 

AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CH ATG TAT TTG 1248 
Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu 
405 410 415 

GGC AAA CCA AAG 1260 
Gly Lys Pro Lys 
420 

(2) INFORMATION FOR SEQ ID N0:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1275 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: unknown 

' (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1275 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product- "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: mat_^peptide 

(B) LOCATION: 466. . .516 

(D) OTHER INFORMATION: /product- "Cathepsin D linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 517~.1275 

(D) OTHER INFORMATION: /product^ "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 
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Itr J-f. ^ '^^'^ CAC CCC GAC GGC CGA 

Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

GTT GAC G6G GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTJ 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 

55 50 

1^ ^^"^ '^TC AAA GGA GTG TGT GCT AAG 

Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

vYi T? ?^ l^^ '^'^A GAA TCT AAT AAC TAC 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 no 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TT6 AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

A^a rZ rf l^'^ ^ ^ '"^ "^GA CCT GGG CAG AAA 

Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 

SI "^^T AAG AGC GCC ATG GGC CGA TCG 

Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Arg Ser 
150 155 

n V JIp f"^ VJ^ r'^^ ^^'^ <^CC ATG GTC ACA TCA ATC 

Gly Phe Leu Gly Phe GLy Phe Leu GLy Phe Ala Met Val Thr Ser He 

165 170 275 

ACA HA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG 
Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val 
180 185 190 

GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT 
Asp Lys 1e Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly 
195 200 205 

ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA 
Thr Asp lie Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg 

215 220 

nl f ^ H*^ '^^^ ^'^'^ ^^'^ 2TC TCA CTT GGC CTA AAA CGC 

lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg 
"^■^^ 230 235 240 

GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT 
Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val 
245 250 255 
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AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC 6CC GAG TJA ACC 
Asn Arg Ala Tyr Tyr Phe Lys Sen Glu He Thr Ser Ala Glu Leu Thr 
260 265 270 

GCC CTT nc CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC 
Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr 
275 280 285 

ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA 
Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly 
290 295 300 

GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC HA CTl TTG ACG 
Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr 
305 310 315 320 

TTC ATG G'AA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT 
Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala 
325 330 335 

AGG TTT CTG CTT ATC GCT AH CAA ATG ACA GCT GAG GTA GCA CGA TTT 
Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe 
340 345 350 

AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC 
Arg Tyr He Gln Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp 
355 360 365" 

TCG GAT AAC AAG GTG ATT CAA TTT GAA 6TC AGC TGG CGT AAG ATT TCT 
Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser 
370 375 380 

ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT 
Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr 
385 390 395 400 

GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA 
Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly 
405 410 415 

CTC CTT ATG TAT JIG GGC AAA CCA AAG 
Leu Leu Met Tyr Leu Gly Lys Pro Lys 
420 425 

(2) INFORMATION FOR SEC ID N0:74: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii ) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1251 
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(Tx) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 
(D) OTHER INFORMATION; /product= "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.. 492 

(0) OTHER INFORMATION: /product- " Gly,Ser linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_pept1de 

(B) LOCATION: 493.. 1251 
(0) OTHER INFORMATION: /product- "Saporin- 

(xD'sEOUENCE DESCRIPTION: SEO ID N0;74: 

iJIt rf l^^ '^y^ "^'^ ^'^C CCC GAG GAT GGC 

Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 

SII rf f 1 ^'^^ CAC ATC AAG CTT CAA CTT 

Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
^° 55 60 

ITn ?f rf f ^^'^ ^" ^ <^2A GTG TGT GCT AAC 240 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
70 75 80 

CGT TAC CTG GCT AT6 AAG 6AA GAT GGA AGA HA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

SH Th^ ^^'^ ^C "^^^ '^^ CGA TTG GAA TCT AAT AAC TAC 336 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 



110 



48 



96 



144 



192 



384 



AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT T6G TAT GTG GCA TTG AAA 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

rf rf ^''^ ^ AAA ACA GGA CCT GGG CAG AAA 432 

Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 

135 140 

S?I ir CCA ATG TCT GCT AAG AGC GCC ATG GGC GGC GGC 

Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Gly Gly 
^ 150 155 160 



480 



aooao: <wo_9»3e3ee*ijL> 



wo 96/36362 



173 



GGC TCT GCC ATG 6TC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC 
Gly Ser Ala Met Va1 Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr 

165 170 175 

GCG GGT CAA TAG TCA TCT TTT 6TG GAT AAA ATC CGA AAC AAC GTA AAG 
Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys 
180 185 190 

GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA 
Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro 
195 200 205 

CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 
Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly 
210 215 220 

ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT 
Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr 
225 230 235 240 

CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA 
Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser 
245 250 255 

GAA ATT ACT TCC GCC GAG HA ACC GCC CTT TIC CCA GAG GCC ACA ACT 
Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr 
260 265 270 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA 
Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu 
275 280 285 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 
Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly 
290 295 300 

TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG 
Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys 
305 310 315 320 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin 
325 330 335 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT 
Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 
340 345 350 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe 
355 360 365 

GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA 
Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys 
370 375 380 
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(2) INFORMATION FOR SEQ ID N0:75: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 1266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: cDNA 

(1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1266 

Clx) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product= "bFGF" 

(1x) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.. 507 

(D) OTHER INFORMATION: /product= " (Gly.Ser), linker" 

(1x) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 508.. 1266 

(D) OTHER INFORMATION: /product= "Saporin" 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 




48 




96 




144 



T GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 
1 Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 




192 
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CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GH ACG GAT GAG TGT TTC TTT TTT GAA CGA TIG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CA6 AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 



GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC 6CC ATG GGC GGC GGC 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Gly Gly 
145 150 155 160 



480 



GGC TCT GGC GGC GGC GGC TCT GCC ATG GTC ACA TCA ATC ACA TJA GAT 528 
Gly Ser Gly Gly Gly Gly Ser Ala Met Val- Thr Ser Me Thr Leu Asp 
165 170 175 

CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC 576 
Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He 
180 185 190 

CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA 624 
Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He 
195 200 205 

GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC 672 
Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe 
210 215 220 

CAA AGT TCC CGA GGA -ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTGf^^ 720 
Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu 
225 230 235 240 

TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA 768 
Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala 
245 250 255 

TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC 816 
Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe 
260 265 270 

CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT 864 
Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp 
275 280 285 
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415 



TAT TTG GGC AAA CCA AAG 
Tyr Leu Gly Lys Pro Lys 
420 



(2) INFORMATION FOR SEQ ID NO; 76: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

dx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . , 1320 

(1x) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

(0) OTHER INFORMATION: /product= "bFGF" 
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TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT 91? 
Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser 

295 300 

AGA AAA GAA CTC GGG TTG GGG ATC GAC HA CJJ TTG ACG TTC ATG GAA 
Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu 
310 315 320 

GCA GT6 AAC AAG AAG GCA CGT 6TG GTT AAA AAC GAA GCT AGG TTT CTG 
Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu 
325 330 335 

CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT m^fi 
Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr lie 
340 345 350 

cTn JJn' f ^ ^CG GAT AAC 1104 

Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn 
355 360 365 

AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA 1152 
Lys val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala He 
3/0 375 380 

TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG 
Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly 
390 395 400 

TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG 1248 
Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met 
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(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.. 561 

(D) OTHER INFORMATION: /product= "Trypsin linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_pept1de 

(B) LOCATION: 562.. 1320 

(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:76: 

ATG GCA GCA GGA TCA ATA ACA ACA HA CCC GCC TTG CCC GAG GAT GGC 48 
Met A1a Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1.5 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TIC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg . 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAG GCT GAC ATC- AAG- CTT CAA CTT- 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
G1n Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA HG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GGC CGA TCG 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Gly Arg Ser 
145 150 155 160 

GGC GGT GGG TGC GCT GGT AAT AGA GTC AGA AGA TCA GTC GGA AGC AGC 528 
Gly Gly Gly Cys Ala Gly Asn Arg Val Arg Arg Ser Val Gly Ser Ser 
165 170 175 
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ilu l?r lys SI SI f TCA ATC ACA 
Ser Cys Gly Gly Leu Asp Leu Gin Ala Met Val Thr Ser He Thr 

185 

200 205 

AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC TfT rri Lcr 
Lys ne APg Asn Asn Val L.s Asp Pro Asn Su ]fr Z f'y S 

^15 220 

a'^P fie IS SIf f ?p n ' p"^ S'' TCT AAA GAA AAA HC CTT AGA AH 
ASP ne Ala val He G y Pro Pro Ser Lys Glu Lys Phe Leu Arg He 
230 235 240 

265 270 
280 285 

SsJ ]yr If. ser Me '^^^ '^'^^ ^AT 

ASP Tyr Gin Ser I e Glu Lys Asn Ala Gin lie Thr Gin Gly Asp 

315 320 

^ 330 
345 

III So fe": ne a's ni G?; SI? r ^^^^ ^gg 

i-eu lie Ala ne Gin Met Thr Ala Glu Val Ala Arg Phe Arg 
^ 360 365 

Tyr ?II oTn Z Ilu Sit TUr T T JI' G'^'^ ^^G 

370 I^c Lys Phe Asp Ser 

■^'^ 380 

395 400 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 
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GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT 1248 
Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp 
405 410 415 

TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG 6AC TTG CAA ATG GGA CTC 1296 
Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu 
420 425 430 

Cn ATG TAT TTG GGC AAA CCA AAG 1320 
Leu Met Tyr Leu Gly Lys Pro Lys 
435 440 

(2) INFORMATION FOR SEQ ID NO: 77: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1299 

(ix) FEATURE: ~ ^ ' ~ 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1. .465 

(D) OTHER INFORMATION: /product^ "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466.. 540 

(0) OTHER INFORMATION: /product^ "(Ser.GlyjJinker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 541.. 1299 

(D) OTHER INFORMATION: /product^ "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:77: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
35 40 45 
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ITn n V SIf '^^^ ^""^ M6 CH CM CTT 192 

Val Asp Gly Val Arg Glu Lys Sen Asp Pro His lie Lys Leu Gin Leu 

55 50 

G^n lu r?^ '^'^ ^ ^'^^ ^TG TGT GCT AAC 240 

Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
^5 70 75 . 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC l^fi 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Z Tyr 
100 105 110 

AAT ACT -TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

Tra lUr I""^ T ^'^ CCT GGG CAG AAA 432 

Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 

135 140 



Sfl ut fr 11^ ^^'^ ^^"^ AAG AGC GCC ATG GCC TCG TCG 

Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Ser Ser 
^ 150 155 160 



480 



TCG TCG GGC TCG TCG TCG TCG GGC TCG TCG TCG TCG GGC TCG TCG TCG 5?R 
Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Ser 
165 170 175 

IS rf f.^ ^'^A ACA HA GAT CTA GTA AAT CCG ACC 576 

Ser Gly Ala Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr 
18" 185 190 

Sa G?v cTn t"^ I*^^ l'^'^ ^'^^ ^^"^ ^ ^''^ AAC GTA AAG 524 

Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys 
195 200 205 

SJn irn ^ '^^^ ATA GCC GTG ATA GGC CCA 

Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro 

215 220 

Irl l^^ AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 
Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly 
"^^^ 230 235 240 

l^^ "'^ ^AT AAC TTG TAT GTG GTC GCG TAT 

mr val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr 
245 250 255 

iT t^^ ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA 

Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser 
260 265 270 



672 



720 



768 



816 
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GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT 864 
Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr 
275 280 285 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA^ 912 
Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu 
290 295 300 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 960 
Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly 
305 310 315 320 

TTG GGG ATC GAC HA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG 1008 
Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys 
325 330 335 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA 1056 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin 
340 345 350 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT 1104 
Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 
355 360 365 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT 1152 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe 
370 "375 ' 380 

GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA 1200 
Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys 
385 390 395 400 

AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG 1248 
Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg 
405 410 415 

CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA 1296 

Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro 

420 425 430 

AAG . 1299 



(2) INFORMATION FOR SEQ ID N0:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1269 
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(ix) FEATURE: 

(A) NAME/KEY; mat_peptide 

(B) LOCATION; 1..465 
(D) OTHER INFORMATION: /product- "bFGF" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 466. .510 

(D) OTHER INFORMATION: /product^ "{Ser.Gly)^ linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 511.. 1269 
(D) OTHER INFORMATION: /product= "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:78: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
20 25 30 

TAC TGC AAA AAC 6GG GGC TTC HC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg lie His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn 
70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC HT TTT GAA CGA JIG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GCC TCG TCG 480 
Ala He Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Ala Ser Ser 
150 155 160 
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TCG ICG GGC TCG TCG TCG TCG GGC GCC ATG GTC ACA TCA ATC ACA TTA 528 
Ser Ser G1y Ser Ser Ser Ser Gly Ala Met Val Thr Ser He Thr Leu 
165 170 175 

GAT CTA GTA AAT CCG ACC 6CG GGT CAA TAG TCA TCT TTT GTG GAT AAA 576 
Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys 
180 185 190 

ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC 624 
He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp 
195 200 205 

ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA AH AAT 672 
He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn 
210 215 220 

TTC CAA AGT TCC CGA 6GA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC 720 
Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn 
225 230 235 240 

TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG 768 
Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg 
245 250 255 

GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG HA ACC GCC CTT 816 
Ala Tyr Tyr-Phe.-Lys^ Ser_Glu_ He„Thr Ser Ala Glu Leu Thr AlaAeu 
260 265 270 

TTC CCA GAG GCC ACA ACT GCA AAT CA6 AAA GCT TTA GAA TAC ACA GAA 864 
Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu G1u Tyr Thr Glu 
275 280 285 

GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA 912 
Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys 
290 295 300 

AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG 960 
Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met 
305 310 315 320 

GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT 1008 
Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe 
325 330 335 

CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC 1056 
Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr 
340 345 350 

ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT 1104 
He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp 
355 360 365 

AAC AAG GTG ATT CAA TTT GAA GTC A6C TGG CGT AAG ATT TCT ACG GCA 1152 
Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala 
370 375 380 
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ATA TAC GGG GAT GCC AAA AAC GGC GTG TH AAT AAA GAT TAT GAT TJC 
e Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Jyr A^p Sle 

395 

GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CJC rrr 
Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu oTn S ITy Leu iZ 
"^05 410 

ATG TAT TTG GGC AAA CCA AAG 
Met Tyr Leu Gly Lys Pro Lys 
420 

(2) INFORMATION FOR SEQ ID NO: 79: 

(1) SEQUENCE CHARACTERISTICS- 
. (A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1200 



1248 



1269 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: L.762 
(0) OTHER INFORMATION: 

optimized saporin" 



/product- "Mammalian codon 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

S S? Tu'r llr n ' th' ^CC GGC CAG 

Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin 

t'^ llr ler Sp 5If f^' ^1' CCC AAC 

lyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn 

^0 25 



30 



CTG AAG TAC GGC GGC ACC GAC ATC GCC GTG ATC GGC CCC CCC TCC AAG 
Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser JJs 
35 40 - 



45 



48 



96 



144 



ctS us Hi n' '''^ ''^^ C-^C ''^^ ACC GTG TCC 192 

Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser 

^" 55 60 

fl,^ ^■'^ sec TAC CTG GCC ATG 240 

Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met 

70 75 80 

GAC AAC ACC AAC GTG AAC CGC GCC TAC TAC TTC AAG TCC GAG ATC ACC ?fifl 
Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser He S ' 

85 90 95 
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TCC GCC GAG CTG ACC GCC CTG TTC CCT GAG GCC ACC ACC GCC AAC CAG 336 
Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin 
100 105 110 

AAG GCC CTG GAG TAC ACC GAG GAC TAC CAG TCC ATC GAG AAG AAC GCC 384 
Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala 
115 120 125 

CAG ATC ACC CAG GGC GAC AAG TCC CGC AAG GAG CTC GGG CTG GGC ATC 432 
- Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He 
130 135 140 

GAC CTG CTG CTG ACC TTC ATG GAG GCC GT6 AAC AAG AAG GCC CGC GTG 480 
Asp Leu Leu Leu Thr Phe Met Glu A1a Val Asn Lys Lys Ala Arg Val 
145 150 155 160 

GTG AAG AAC GAG GCC CGC TJC CTG CTG ATC GCC ATC CAG ATG ACC GCC 528 
Val Lys Asn Glu Ala Arg Phe Leu Leu lie Ala He Gin Met Thr Ala 
165 170 175 

GAG GTG GCC CGC TTC CGC TAC ATC CAG AAC CTG GTG ACC AAG AAC TTC 576 
Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe 
180 185 190 

CCC AAC AAG TTC GAC TCC GAC AAC AAG GTG ATC CAG TTC GAG GTC AGC 624 
Pro.Asn Lys. Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser 
195 200 205 " 

TGG CGC AAG ATC TCC ACC GCC ATC TAC GGC GAC GCC AAG AAC GGC GTG 672 
Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val 
210 215 220 

TTC AAC AAG GAC TAC GAC TTC GGC TTC GGC AAG GTG CGC CAG GTG AAG 720 
Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys 
225 230 235 240 

GAC CTG CAG ATG GGC CTG CTG ATG TAC CTG GGC AAG CCC AAG 762 
Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 



TAG 



765 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1230 

(D) OTHER INFORMATION: /product= "E. coli codon optimized 



FGF-SAP 



Mn fh^ T ? ^''^ "TCG TGG TAC GTA GCA TTG AAA 

Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 

375 380 



144 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:80: 

i5e? aS ^"^ 'iV '^'^ ^"''^ CCG GAG GAC GGC 48 

Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 

260 265 270 

nl III «r ^'"^ AAG GAC CC6 AAA CGC CTG 96 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
275 280 285 

TAT TGC AAA AAC GGT GGT TTT TTC CTG CGT ATC CAC CCG GAT GGC CGC 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg He His Pro Asp Gly Arg 
290 295 300 

5If As! gS If ^ f^' I'' CTG CAA TTG 192 

Val Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu 
305 310 

CAA GCA GAG 6AA CGC GGT GTT GTA AGC ATC AAG GGC GTT TGC GCG AAT ?4n 
Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Sa Z 
^"^^ 325 330 

CGT TAC CTG GCG ATG AAA GAG GAT GGC CGC CTG CTG GCC TCC AAG TGT 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
'^^ 340 345 

Sit S l^^ "^^ C'^T CTG GAG TCG AAC AAT TAT 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
355 360 365 



288 



336 



384 



432 



480 



Tra jir nl rtn l""^ ^ P'^ ^^"^ '^^^ '^^ ^^G GGT CCA GGT CAG AAA 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
385 390 

lu nl VJ^ '^^^ ""^^ ■'CG GCC AAA TCG GCC ATG GTC ACT TCT 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
^00 405 4J0 

nl S f ^ ^^'^ CCG ACC GCT GGT CAG TAC AGC TCG JTJ 528 
I e Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 
420 425 430 

SIf Jcl '^■^ '^^^ CCG AAT TTA AAA TAC GGT 576 

Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 
435 440 445 
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GGC ACG GAT ATI GCA GTG ATT G6C CCG CCG TCT AAG GAA AAG TTC TIG 
Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu 
450 455 460 

CGT ATT AAC TTT CAA AGC TCT CGC GGC ACT GTG TCT CTG GGC HA AAA 
Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
465 470 475 

CGC GAT AAT TTG TAC GTT GTA GCG TAC CTG GC6 AT6 GAT AAT ACC AAT 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
480 485 490 

GTA AAC CGT GCT TAC TAT TTC AAA AGC GAA ATT ACC TCT GCT GAA CTG 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu 
495 500 505 510 

ACT GCA TTA TTC CCG GAA GCG ACT ACT GCC AAT CAG AAA GCC CTG GAA 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 
515 520 525 

TAT ACC GAA GAT TAT CAG TCG ATT GAA AAA AAC GCG CAA ATT ACC CAG 
Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin 
530 535 540 

GGC GAC AAA TCG CGC AAA GAG TTG GGT CTG GGT ATT GAC CTG CTG CTG 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu 

" 545 ' - - ■ - 550 " 555 - - 

ACG TTT AT6 GAG GCG GTC AAC AAA AAA GCT CGT GTA GTG AAA AAC GAA 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
560 565 570 

GCT CGC TTT CTG CTG ATC GCT ATI CAA ATG ACT GCT GAA GTT GCT CGT 
Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg 
575 580 585 590 

TTC CGT TAC ATT CAG AAC TTG GTT ACT AAG AAC TTJ CCG AAC AAA TTC 
Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 
595 600 . 605 

GAC TCC GAT AAT AAG GTT ATT CAG TTC GAA GTG'AGC TGG CGC AAG ATT 
Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He 
610 615 620 

TCG ACG GCT ATT TAT GGC GAT GCC AAA AAC GGC GTA TTT AAC AAA GAT 
Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
625 630 635 

TAT GAC TTC GGT TTT GGC AAG GTT CGT CAG GTG AAA GAT TTG CAG ATG 
Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 
640 645 650 

GGT CTG CTG ATG TAC TTG GGC AAG CCG AAA TAA 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
655 660 
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672 

720 

768 

816 

864 
912 
960 
1008 
1056 
1104 
1152 
1200 
1233 
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(2) INFORMATION FOR SEQ ID N0:81: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .462 

(D) OTHER INFORMATION: /product^ "FGF 2 - lie Mutation at 
sidue 116" 



48 



96 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:81: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
415 420 425 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
430 435 440 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC CAC CCC GAC GGC CGA GTT 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg His Pro Asp Gly Arg Val 
445 450 455 

GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTJ CAA CTT CAA 192 
Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu Gin 
460 465 470 

GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC CGT 240 
Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn Arg 
"^75 480 485 490 

TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT GTT 
Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys Val 
495 500 505 

ACG GAT GAG TGT HC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC AAT 336 
Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr Asn 
510 515 520 

ACT TAC ATA TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA CGA 384 
fhr Tyr lie Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys Arg 
525 530 535 

ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA GCT 432 
Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys Ala 
540 545 550 



288 
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ATA Cn TTT CTT CCA ATG TCT GCT AAG AGC TAA 
lie Leu Phe Leu Pro Met Ser Ala Lys Ser 
555 560 



465 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

' (A) NAME/KEY: CDS 
(B) LOCATION: 1..462 

CD) OTHER INFORMATION: /product= "FGF 2 - Glu Mutation at 
Residue 119" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:82: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT 6GC 48 
Met Ala Ala Gly Ser Ile.Thr Thr leu Pro Ala Leu Pro Glu Asp Gly 
155 160 165 170 

6GC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
175 180 185 

TAC TGC AAA AAC G6G GGC TTC TIC CTG CGC CAC CCC GAC GGC CGA GTT 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg His Pro Asp Gly Arg Val 
190 195 200 

GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT CAA 192 
Asp Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu Gin 
205 210 215 

GCA GAA GAG AGA GGA GTT GTG' TCT ATC AAA GGA GTG-TGT GCT AAC CGT 240 
Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn Arg 
220 225 230 

TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT GTT 288 
Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys Val 
235 240 245 250 

ACG GAT GAG TGT TTC TTT TIJ GAA CGA TTG GAA TCT AAT AAC TAC AAT 336 
Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr Asn 
255 260 265 

ACT TAC CGG TCA AGG GAA TAC ACC AGT TGG TAT GTG GCA TTG AAA CGA 384 
Thr Tyr Arg Ser Arg Glu Tyr Thr Ser Trp Tyr Val Ala Leu Lys Arg 
270 275 280 



wo 96/36362 



PCT/US96/07164 



190 



?f 1^ '^'^^ ^ ACA GGA CCT GG6 CAG AAA GCT 
Thr Gly Gin Jyr Lys Leu Gly Sen Lys Thr Gly Pro Gly Gin Lys Al 

290 295 

ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC TAA 
lie Leu Phe Leu Pro Met Ser Ala Lys Sen 
300 305 



432 



465 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 465 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
(8) LOCATION: 1..462 
Residue {2^-°^'^^'' INFORMATION: /product- "F6F 2 - Ala Mutation at 



Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys Val 
^ 240 245 250 

S I*^^ ^^'^ TH TTT GAA CGA TIG 6AA TCT AAT AAC TAC AAT 

Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr Asn 
255 260 265 



48 



96 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:83: 

Tu rf n'^ t^'' ™ '^^^ ■^^'^ CCC GAG GAT GGC 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
^ 160 155 

G?S S t';^ H'^ ^'^^ ■^^C AAG GAC CCC AAG CGG CTG 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
175 180 185 

cS fJ? r rf '"^ ^'^^ ^A'^ CCC GAC GGC CGA GTT 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg His Pro Asp Gly Arg Val 

195 200 

GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT CAA 
ASP Gly val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu Gin 
205 210 215 

A?J fT. r^^ ^"^^ ^CT ATC AAA GGA GTG TGT GCT AAC CGT 

Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn Arg 
220 225 230 

TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT GTT 



192 



240 



288 



336 
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ACT TAC CGG TCA AGG AAA GCA ACC AGT TGG TAT GTG GCA TTG AAA CGA 384 
Thr Tyr Arg Ser Arg Lys Ala Thr Sen Trp Tyr Val Ala Leu Lys Arg 
270 275 280 

ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA GCT 432 
Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys Ala 
285 290 295 

ATA CTT TTT CTT CCA AT6 TCT GCT AAG AGC TAA 465 
lie Leu Phe Leu Pro Met Ser Ala Lys Ser 
300 305 



C2) INFORMATION FOR SEQ ID N0:84: 

(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 465 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: L-462 _ 

(D) OTHER INFORMATION: /product= "FGF 2 - Trp Mutation at 
Residue 123" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:84: 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser He Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
155 160 165 170 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 
175 180 185 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC CAC CCC GAC GGC CGA GH 144^* 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg His Pro Asp Gly Arg Val 
190 195 200 

GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT CAA 192 
Asp Gly Val Arg Glu Lys Ser Asp Pro His He Lys Leu Gin Leu Gin 
205 210 215 

GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC CGT 240 
Ala Glu Glu Arg Gly Val Val Ser He Lys Gly Val Cys Ala Asn Arg 
220 225 230 

TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT GTT 288 
Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys Val 
235 240 245 250 
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336 




384 




432 



ATA CTT TTT CTT CCA ATG TCT 6CT AAG AGC TAA 
lie Leu Phe Leu Pro Met Ser Ala Lys Ser 
300 305 



465 



(2) INFORMATION FOR SEQ ID N0:85: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(0) OTHER INFORMATION: /note= "Primer for Protamine" 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:85: 
TACATGCCAT GGCCAGGTAC AGATGCTGTC GCAGCCAGAG CCGGAGCAGA TATTACCGCC 



(2) INFORMATION FOR SEQ ID N0:86: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Protamine" 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:86: 
GCAGCTCCGC CTCCTTCGTC TGCGACTTCT TTGTCTCTGG CGGTAATATC TGCTCCGGCT 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note- "Primer for Protamine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GACGAAGGAG GCGGAGCT6C CAGACACGGA GGAGAGCCAT GAGGTGCTGC CGCCCCAGGT 



(2) INFORMATION FOR SEQ ID N0:88: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Protamine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:88; 
ATATATCCTA G6TTAGTGTC TTCTACATCT CGGTCTGTAC CTGGGGCGGC AGCACCTCA 59 
(2) INFORMATION FOR SEQ ID N0:89: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:89: 
CGTATCAGGC GGCCGCC6CC ATGGTGACCT CCATCACCCT GGACCTGGTG AACCCCACCG 60 
CCGGCC 55 
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(2) INFORMATION FOR SEQ ID N0:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mafmialian Codon 
Preferred Saporin" 

(xi) -SEQUENCE DESCRIPTION: SEQ ID N0:90: 
TTGGGGTCCT TCACGTTGTT GCGGATCTTG TCCACGAAGG AGGAGTACTG GCCGGC6GTG 60 
GGGTTCACC 

(2) INFORMATION FOR SEQ ID N0:91: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Marmialian Codon 
Preferred Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

AACAACGTGA AGGACCCCAA CCTGAAGTAC 6GCGGCACCG ACATCGCCGT GATCGGCCCC 60 
CCCTC 

65 

(2) INFORMATION FOR SEQ ID N0:92: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE; 

(D) OTHER INFORMATION: /note= "Primer for Maimialian Codon 
Preferred Saporin" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:92: 
GTGCCGCGGG AGGACT6GAA GTTGATGCGC AGGAACTTCT CCTTGGA6GG GGGGCCGATC 60 
ACGGC 65 
(2) INFORMATION FOR SEQ ID N0:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note== "Primer for Mammalian Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CTCCCGCGGC ACCGTGTCCC TG6GCCTGAA GCGCGACAAC CTGTACGTGG TGGCCTACCT 60 
GGCCATGGAC AACAC' ' " 75 

(2) INFORMATION FOR SEQ ID N0:94: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:94: 
GCGGTCAGCT CGGCGGAGGT GATCTCGGAC TTGAAGTAGT AGGCGCGGTT CACGTTGGTG 60 
TTGTCCATGG CCAGGTA 77 
(2) INFORMATION FOR SEQ ID N0:95: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



^SOOCID: <WO_9e3e3e2A1JL> 



wo 96/36362 



PCT/US96/07164 



196 



(ix) FEATURE: 

P..f.. H^c °"^^^^„^^^ORmTION: /note= "Primer for Mannnalian Codon 
Preferred Saponn 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
GCCGAGCTGA CCGCCCTGTT CCCTGAGGCC ACCACCGCCA ACCAGAAGGC CCTGGAGTAC 
ACCGAGGACT ACCAGTCC 
(2) INFORMATION FOR SEQ ID N0:96: 

(i). SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

Preferrei^LporiS"^^™^^^^'^"' Marmialian Codon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
AGCCCGAGCT CCTTGCGGGA CTTGTCGCCC TGGGTGATCT GGGCGTTCTT CTCGATGGAC 
TGGTAGTCCT CGGTGT 
(2) INFORMATION FOR SEQ ID N0:97: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cix) FEATURE: 

Prof l^l ^^^^^ INFORMATION: /note= "Primer for Marmialian Codon 
Hreferred Saporin" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
TATAGAATTC CTCGGGCTGG GCATCGACCT GCTGCTGACC TTCATGGAGG CCGTGAACAA 
GAAGGCCCGC GTGG 



60 
78 



60 
76 



60 
74 
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(2) INFORMATION FOR SEQ ID N0:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:98: 
CGGCG6TCAT CTGGATGGCG ATCAGCAGGA AGCGGGCCTC GTTCTTCACC ACGCGGGCCT 60 
TCTTGTTC 68 
(2) INFORMATION FOR SEQ ID N0:99: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

Ce-) -STRANDEDNESS:- single " - " - - - " " 

(D) TOPOLOGY: linear 



(Ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin*' 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:99: 
CGCCATCCAG ATGACCGCCG AGGT6GCCCG CTTCCGCTAC ATCCAGAACC TGGTGACCAA 60 
GAACTTCCCC 70 
(2) INFORMATION FOR SEQ ID NO: 100: 

Ci) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GGCGGATCCC AGCTGACCTC GAACTGGATC ACCITGTTGT CGGAGTCGAA CnGTTGGGG 60 
AAGTTCTTGG TCACCA 7g 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(ix)' FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CCGGGATCCG TCAGCTGGCG CAAGATCTCC ACCGCCATCT ACGGCGACGC CAAGAACGGC 60 

^ 61 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
6CACCTTGCC GAAGCCGAAG TCGTAGTCCT TGTTGAACAC GCCGTTCTTG GCGTCGCCGT 60 
AGAT ^4 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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(ix) FEATURE; 

(D) OTHER INFORMATION: /note= -Primer for Mammalian Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

TTCGGCTTCG GCAAGGTGCG CCAGGTGAAG GACCTGCAGA TGGGCCTGCT GATGTACC 58 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for Mammalian' Codon 
Preferred Saporin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
TGAACGTGGC GGCCGCCTAC TTGGGCTT6C CCAGGTACAT CAGCAGGCCC AT 52 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(1x) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for SAP-6" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CATATGTGTG TCACATCAAT CACATTA6AT 30 
(2) INFORMATION FOR SEQ ID NO: 106: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(1x) FEATURE: 

(D) OTHER INFORMATION: /note= "Primer for SAP-6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
CAGGTTTGGA TCCTHACGT T 21 
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Claims 

1 . A pharmaceutical composition having the formula: 
receptor-binding internalized ligand — nucleic acid binding domain — cytocide- 

encoding agent, wherein: 

receptor-binding intemalized ligand is a polypeptide reactive with a cell 
surface receptor; 

nucleic acid binding domain binds to a nucleic acid, the domain being 
conjugated or fused to the receptor-binding intemalized ligand; 

cytocide-encoding agent is a nucleic acid molecule encoding a cytocide, the 
agent being bound to the nucleic acid binding domain; and wherein 

the receptor-binding intemalized ligand — nucleic acid binding 
domain — cytocide-encoding agent binds to the cell surface receptor and internalizes the 
cytocide-encoding agent in cells bearing the receptor. 

2. A pharmaceutical composition having the formula: 
receptor-binding intemalized ligand — nucleic acid binding domain — prodrug- 

encoding agent, wherein: 

receptor-binding intemalized ligand is a polypeptide reactive with a cell 
surface receptor; 

nucleic acid binding domain binds to a nucleic acid, the domain being 
conjugated or fused to the receptor-binding intemalized ligand; Jv-. 

prodmg-encoding agent is a nucleic acid molecule encoding a prodmg, the 
agent being bound to the nucleic acid binding domain; and wherein 

the receptor-binding intemalized ligand — ^nucleic acid binding domain — 
prodrug-encoding agent binds to the cell surface receptor and intemalizes the cytocide- 
encoding agent in cells bearing the receptor. 

3. The composition of either of claims 1 or 2 wherein the receptor- 
binding intemalized ligand is a polypeptide reactive with an FGF receptor. 
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4. The composition of either of claims 1 or 2 wherein the receptor- 
binding internalized ligand is selected from the group consisting of a polypeptide reactive 
with a VEGF receptor and a polypeptide reactive with an HBEGF receptor. 

5. The composition of either of claims 1 or 2 wherein the receptor- 
binding internalized ligand is a cytokine. 

6. The composition of claim 1 wherein the cytocide-encoding agent 
encodes a protein that inhibits protein synthesis. 

7. The composition of claim 6 wherein the protein is a ribosome 
inactivating protein. 

8. The composition of claim 7 wherein the ribosome inactivating protein 

is saporin. 

9. The composition of claim 7 wherein the ribosome inactivating protein 

is gelonin. 

10. The composition of claim 6 wherein the protein inhibits elongation 

factor 2. 

1 1 . The composition of claim 1 0 wherein the protein is diphtheria toxin. 

12. The composition of claim 2 wherein the prodrug-encoding agent 
encodes HSV-thymidine kinase. 

13. The composition of either of claims 1 or 2 wherein the growth factor is 
a polypeptide reactive with the FGF receptor and the nucleic acid binding domain is poly-L- 
lysine. 
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14. The composition of either of claims 1 or 2 wherein the nucleic acid 
binding domain is selected from the group consisting of heHx-tum-helix motif proteins, 
homeodomain proteins, zinc finger motif proteins, steroid receptor proteins, leucine zipper 
motif proteins, helix-loop-helix motif proteins, and p-sheet motif proteins. 

15. The composition of either of claims 1 or 2 wherein the nucleic acid 
binding domain is selected from the group consisting of AP-1, Sp-1, rev, GCN4, A,cro, Xcl, 
TFIIA, myoD, retinoic acid receptor, glucocosteroid receptor, SV40 large T antigen, and 
GAL4. 

16. The composition of either of claims 1 or 2 wherein the nucleic acid 
binding domain is selected from the group consisting of poly-L-lysine, protamine, histone 
and spermine. 

17. The composition of claim 1 wherein the nucleic acid binding domain 
binds a DNA molecule that encodes a ribosome inactivating protein. 

18. The composition of claim 1 wherein the nucleic acid binding domain 
binds the coding region of saporin DNA. 

19. The composition of claim 1 wherein the cytocide-encoding agent 
further comprises a tissue-specific promoter. 

20. The composition of claim 2 wherein the prodrug-encoding agent 
further comprises a tissue-specific promoter. 

21. The composition of either of claims 19 or 20 wherein the tissue- 
specific promoter is selected from the group consisting of alpha-crystalline, tyrosinase. 
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a-fetoprotein, prostate specific antigen, CEA, a-actin, VEGF receptor. erbB-l, C-myc, 
cyclin D, FGF receptor and gamma-crystalline promoter. 

22. The composition of any one of claims 1-21, further comprising at least 
one linker that increases the serum stability or intracellular availability of the nucleic acid 
binding domain, the addition of said linker(s) resulting in the formula: 

receptor-binding internalized ligand-(L),-nucleic acid binding domain- 
cytocide encoding agent or the formula: 

receptor-binding intemalized ligand-(L),_nucleic acid binding domain- 
prodrug encoding agent wherein: 

L is at least one linker; and 

q is 1 or more, such that the conjugate retains the ability to bind to a cell 
surface receptor and internalize the cytocide-encoding agent, and wherein the cytocide- 
encoding agent is bound to the nucleic acid binding domain. 

23. The composition of claim 22 wherein the linker increases the 
flexibility of the conjugate. 

24. The composition of claim 23 wherein the linker is selected from the 
group consisting of (Gly„Ser,)„, (Ser„Gly,)„ and (AlaAlaProAla)„ in which n is 1 to 6, m is 1 
to 6 and p is 1 to 4. 

25. The composition of claim 24 wherein m is 4, p is 1 and n is 2 to 4. 

26. The composition of claim 22 wherein the linker is a disulfide bond. 

27. A therapeutically effective amount of a pharmaceutical composition 
according to any one of claims 1, 2 or 22, for use in the manufacture of a medicament for 
preventing excessive cell proliferation in the eye, comprising contacting the eye with a cell 
proliferation-inhibiting amount, wherein: 



MWCID: •<WO_9eae3(8A1jL^ 



wo 96/36362 



PCT/US96/07164 



205 

the inhibited cells are epithelial cells, endothelial cells, fibroblast cells or 
keratocytes; and 

the excessive amount is an amount greater than that required to heal the 

surgical wound. 

28. A therapeutically effective amount of a pharmaceutical composition 
according to any one of claims 1 , 2 or 22, for use in the manufacture of a medicament for 
treating cancer, comprising contacting the cancer cells with an amount of the composition 
sufficient for inhibiting proliferation of the cancer cells. 

29. A therapeutically effective amoimt of a pharmaceutical composition 
according to any one of claims 1, 2 or 22, for use in the manufacture of a medicament for 
treating smooth muscle cell hyperplasia, comprising contacting the smooth muscle cells with 
an amount of the composition sufficient for inhibiting hyperplasia of smooth muscle cells. 



30. A pharmaceutical composition having the formula: 
receptor-binding internalized ligand-cytocide-encoding agent nucleic acid 

binding domain, wherein: 

receptor-binding internalized ligand is a polypeptide reactive with a cell 
surface receptor; 

cytocide-encoding agent is a nucleic acid molecule encoding a cytocide, the 
agent being conjugated to the receptor-binding internalized ligand; and wherein the cytocide- 
encoding agent is bound to' the nucleic acid binding domain; and wherein the receptor- 
binding internalized ligand-cytocide-encoding agent nucleic acid binding domain binds to the 
cell surface receptor and is internalized in cells bearing the receptor. 

31. The composition of claim 30 wherein the nucleic acid binding domain 
is poly-L-lysine. 
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32. The composition of claim 30 wherein the receptor binding internalized 
ligand is a polypeptide reactive with an FGF receptor. 
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